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ABSTRACT 


Ansah,  Joseph,  A.,  Ph.  D.,  Purdue  University,  August,  1974. 
Destination  Choice  Modelling  and  The  Disaggregate  Analysis  of  Urban 
Travel  Behavior.  Major  Professors:  Gilbert  T.  Satterly  and  K.  C, 
Sxeedharan  Pillai. 


This  research  postulates  a  destination  choice  methodology  that 
can  be  incorporated  in  an  operational  set  of  models  of  urban  travel 
behavior.  The  model  formulation  presented  has  a  prior  provision  for 
making  aggregate  forecasts  of  types  of  travel  behavior  that  the 
current  quality  of  data  can  allow. 

The  belief  that  disaggregate  models  developed  with  traditional 
sampling  designs  require  smaller  samples  than  aggregate  models  is 
not  supported.  This  belief  is  a  result  of  the  lack  of  any  research 
that  has,  at  least  theoretically,  specified  a  framework  for  making 
aggregate  predictions  prior  to  disaggregate  model  development.   It 
is  argued  that  separate  sampling  (analogous  to  stratified  sampling) 
be  used  to  develop  models  without  using  excessive  samples. 

Empirical  analyses  were  carried  out  on  shopping  destination 
choice  in  the  Indianapolis  area,  using  specific  shopping  areas 
instead  of  zones.  A  sample  of  automobile  users  was  used. 

The  multinomial  logit  model  was  found  to  be  impractical  and 
of  low  predictive  power  in  modelling  the  choice  of  specific  shopping 
destinations , 
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A  multinomial  response  relation  model  that  is  proposed, 
indicates  that  it  is  more  pragmatic,  given  the  current  data  quality, 
to  predict  types  of  shopping  destination  choice  behavior.  Attitu- 
dinal  market  segmentation  was  found  to  be  worthwhile  in  model  devel- 
opment. There  was  the  indication  that  only  few  attitudinal  factors 
were  necessary.   Empirical  investigations  with  the  methodology  yielded 
encouraging  results.   Certain  interaction  effects  had  to  be  included 
in  some  of  the  models  developed. 

The  extension  of  destination  choice  modelling  as  a  logistic 
discrimination  problem  is  also  discussed. 


CHAPTER  1 
INTRODUCTION 

In  order  to  have  travel  demand  models  that  are  relevant  and  valid 
in  the  urban  setting,  some  desirable  properties  that  a  suitable  model- 
ling approach  should  include  are: 

1,  The  incorporation  of  transportation,  socio-economic,  and 
activity  system  effects  as  well  as  the  attitudinal  orienta- 
tion of  travellers. 

2,  The  consideration  of  competition  among  choices  at  each 
decision  level. 

3,  A  structure  based  on  behavioral  as  well  as  theoretical  and 
a  priori  grounds . 

4,  Provision  for  the  computation  of  a  valid  equilibrium  between 
demand  and  supply. 

5,  Feasibility  of  statistical  estimation  and  computational 
efficiency. 

That  the  present  urban  transportation  planning  (UTP)  package  of 
travel  demand  models  cannot  meet  most  of  the  above  conditions  is  well 
documented  [1,2,3,4,5,6],*  Both  workers  on  trip  generation  [1,6,7,8, 


♦Numbers  in  square  parenthesis  refer  to  reference  material  listed  at 
the  end  of  the  report. 


9,10]  and  on  modal  split  [11,  12,  13,  14,  15]  have  recommended,  for 
reasons  briefly  given  here,  that  disaggregate  models  be  developed. 
Disaggregate  models  at  one  extreme  analyze  the  travel  behavior  of 
individuals  while  aggregate  models  at  another  extreme  deal  with 
variables  representing  averages  or  totals  for  large  groups  of  entities 
(e.g.,  households  in  a  traffic  zone)  in  their  group's  travel  behavior 
predictive  models.  Since  conventional  trip  generation  models  do  not 
consider  transportation  variables  they  work  on  the  assumption  that 
trip  frequency  is  independent  of  changes  in  the  transportation  system. 
Most  available  large  scale  modal  split  functions,  among  other  short- 
comings, have  either  ignored  all  the  door-to-door  components  of  travel 
time  and  cost  or  they  have  aggregated  them  in  ways  that  limit  the 
applications  of  the  modelling  approach.  The  route  assignment  proce- 
dure tends  to  be  mechanical  since  it  cannot  interact  with  the  demand 
functions  by  virtue  of  the  latter  being  devoid  of  transportation  policy 
variables. 

Manheira  [5]  has  discussed  requirements  to  be  met  by  travel  demand 
models  relating  to  relevance,  validity,  testability,  compatibility 
and  feasibility,   Ben-Akiva  [17]  has  pointed  out  the  need  to  have  a 
behavioral  assumption  on  a  priori  grounds,  relating  to  the  striacture 
of  the  models.  Under  the  relevance  criterion  the  models  should  dis- 
cern differences  among  alternative  transportation  strategies,  recog- 
nize the  existence  of  different  market  segments  and  also  serve  toward 
the  evaluation  of  the  alternative  strategies.  Some  examples  of 
transportation  strategies  whose  impacts  the  model  should  detect  are: 

(i)   Ramp-metering  and  lane  reversal  operations  for  improving 


traffic  flow  and  reducing  the  usage  of  alternative  local 

roads  in  populated  areas, 
(ii)  The  increase  of  parking  prices  at  certain  times  of  the  day 

or  the  restriction  of  automobiles  from  certain  areas  at 

different  times  of  the  day. 
The  above  examples  relate  to  an  auto-oriented  travel  market  seg- 
ment.  Further  classification  may  be  on  the  basis  of  income,  social 
class,  stage  of  life  cycle,  etc.   Several  researchers  have  recommended 
or  used  market  segmentation  in  both  disaggregate  and  aggregate  analyses 
[5,19,20,21,22,23,24].   The  demand  that  models  be  valid  is  not  ques- 
tioned. The  lack  of  sufficient  data  is  often  used  as  an  excuse  for 
the   insufficient  realization  of  validity.  Reasonable  evidence  of 
validity  from  limited  available  data  would  not  only  help  in  justifying 
further  data  collection,  but  the  sampling  can  be  better  designed  if 
the  limited  analysis  has  improved  the  researcher's  understanding  of 
the  system  in  question. 

The  requirement  that  a  travel  demand  model  exhibit  compatibility 
implies  that  the  travel  demand  should  be  allocated  to  the  network  con- 
sistent with  the  services  that  the  transportation  facilities  can  pro- 
vide and  the  travellers'  perception  of  such  services.  The  demand  is 
often  computed  with  a  supply  configuration  assumed.  Though  some  exact 
procedures  for  computing  equilibrium  have  been  proposed,  the  under- 
lying assumptions  and  their  level  of  empirical  testing  have  not 
rendered  them  operational  [25,  26].  A  few  iterative  procedures  are 
available  for  obtaining  an  approximate  equilibrium  in  aggregate 
modelling  [27,  28] . 


It  is  necessary  that  demand  models  be  feasible  and  meet  current 
time,  financial  and  manpower  constraints.   In  the  extreme  situation 
one  would  seek  to  have  causal  models.  The  farthest  causal  analysis 
has  been  taken  in  transportation  planning  is  at  best  explanatory  [10, 
34]  in  its  use  in  studying  factors  that  affect  trip  generation  and  the 
length  of  a  work  trip.  Though  it  has  been  argued  that  observations 
on  human  populations  cannot  be  analyzed  with  the  classical  tools  for 
experimental  studies  [29,30,31,32,33],  improved  statistical  tools  can 
be  used  to  handle  the  limited  size  and  quality  of  available  travel 
survey  data.   If  one  has  to  use  lower  level  (e.g.  probabilittic)  models 
it  has  to  be  ensured  that  their  structure  is  behavioral.   Ben-Akiva  [17] 
has  extensively  discussed  why  a  direct  demand  model  is  behavioral ly 
more  satisfactory.   Computational  feasibility  may  occasionally  require 
that  an  indirect  or  recursive  formulation  be  used. 

Research  Objectives 

In  a  previous  study,  Kannel  [10]  examined  the  trip  generation 
behavior  of  a  relatively  stable  group  of  households  (the  residents 
had  not  relocated  over  a  period  of  seven  years) .  The  household  trip 
generation  regression  equations  developed  could  only  explain  under 
forty  percent  of  the  variation  in  trip  generation.  It  is  unlikely 
that  regression  models  using  similar  variables  for  less  stable  socio- 
economic groups  can  achieve  better  results.  It  is  necessary  that 
other  analytical  tools  be  considered  for  modelling  not  only  trip 
generation,  but  other  travel  decisions  like  destination  and  route 
choices  at  an  operationally  feasible  level.  Such  an  endeavor  would 
require  a  comprehensive  and  costly  data  collection  effort. 


The  development  of  feasible  analytical  tools  must  precede  a  massive 
expenditure  of  manpower  and  fiscal  resources.   Research  work  using 
present  data  with  some  promise  for  shedding  light  on  useful  analyti- 
cal techniques  can,  therefore,  be  a  very  valuable  guide  to  decision 
makers  with  regard  to  future  directions  in  travel  demand  modelling. 
The  principal  concern  of  this  research  is  the  development  of  an 
operational  framework  for  modelling  urban  shopping  destination  choice 
behavior. 

Current  disaggregate  approaches  are  so  clouded  by  mode  choice 
considerations  that  it  is  hard  to  tell  their  effectiveness  in  predict- 
ing destination  choices  directly.  Arbitrarily  selected  sets  of 
alternative  shopping  destination  alternatives  are  used  for  individ- 
uals, a  procedure  that  is  impractical  on  a  large  scale.   Provided 
there  is  information  on  the  geographical  coordinates  of  households 
and  shopping  area  locations,  even  at  the  sub-zonal  centroid  level, 
airline  distances  and  shopping  area  attraction  measures  can  be  used 
to  develop  a  more  consistent  procedure  for  selecting  shopping  destin- 
ation choice  sets.  Notwithstanding  the  choice  set  definition  problem 
for  spatial  choices,  the  aggregation  problems  associated  with  determin- 
ing aggregate  predictions  from  the  current  formulation  of  disaggregate 
models  is  mainly  a  result  of  the  failure  to  specify  an  aggregation 
methodology  prior  to  disaggregate  model  development.  This  research 
effort  uses  functional  shopping  area  classifications  of  conceptual 
destination  choice  alternatives  in  model  development. 

One  major  expectation  in  disaggregate  modelling  is  a  saving  in 
data  collection  in  operational  applications.  Disaggregate  models,  in 


addition,  attempt  to  consider  as  many  travel  market  segments  as  possi- 
ble. A  random  sample  is  likely  to  under-represent  some  of  the  less 
common  market  segments.  With  a  conscious  effort  to  represent  each 
market  segment,  a  suitable  sampling  scheme,  tied  to  the  modelling 
theory,  can  be  utilized  to  cover  all  the  relevant  market  segments  with- 
out making  the  total  sample  size  prohibitive. 
The  major  objectives  of  this  study  were: 

1.  To  investigate  the  modelling  of  shopping  destination  choices 
for  an  essentially  auto-committed  group  of  households  at  the 
disaggregate  level. 

2.  To  attempt  to  incorporate  attitudinal  indices  in  the  model- 
ling of  types  of  shopping  destination  choice. 

3.  To  determine, from  the  estimation  and  application  problems 
encountered  in  the  study,  how  data  can  be  efficiently  col- 
lected and  analyzed  for  operational  transportation  planning. 


Organization  and  Summary 

Chapter  2  discusses  the  problem  of  destination  choice  modelling 
in  a  spatial  context,  unclouded  by  mode  choice  effects.  The  issue  of 
destination  choice  set  definition  is  considered  with  the  proposal  that 
a  functional  destination  classification  be  adopted  in  order  to  devel- 
op practical  travel  behavior  models.  The  aggregation  problem  is 
discussed.   Some  inconsistencies  and  possible  analytical  problems  in 
some  recently  proposed  approaches  are  also  discussed.   It  is  maintain- 
ed that  a  framework,  not  necessarily  operational  because  of  data 


collection  problems,  for  aggregation  should  be  a  prerequisite  for  the 
development  of  a  compatible  disaggregate  model.  With  the  qualitative 
and  categorical  nature  of  most  of  the  data  currently  available  it  is 
proposed  that  an  experimental  design-based  approach  be  used  to  set  up 
a  modelling  framework  that  simultaneously  considers  different  travel 
segments  in  a  given  choice  situation.  Some  methodological  implications 
of  introducing  attitudiaal  factors  into  the  disaggregate  modelling 
scenario  is  also  discussed. 

Chapter  3  briefly  introduces  the  disaggregate  modelling  approach 
highlighted  by  recent  research  work  [13,16,17].  Some  underlying 
assumptions  in  model  parameter  estimation  and  their  interpretation 
are  discussed. 

Chapter  4  presents  the  results  of  empirical  analysis  based  on  the 
theory  and  concepts  discussed  in  Chapter  3. 

A  detailed  justification  is  given  in  Chapter  5  to  the  concept  of 
functional  destinations  introduced  in  Chapter  2.  A  Multinomial  Re- 
sponse Relation  methodology  is  presented  that  accounts  for  the  differ- 
ent travel  market  segments  and  their  response  to  travel  choice  alterna- 
tives. 

In  the  context  of  a  common  choice  set  for  all  individuals,  an 
equivalence  is  readily  established  between  the  choice  context  multi- 
nomial logit  modelling  approach  and  a  recently  developed  logistic 
discrimination  approach. 

A  major  advantage  of  the  multinomial  response  relation  approach 
is  the  ability  provided  for  the  investigation  of  interaction  among 
qualitative  variables  that  classify  travel  market  segments  or  travel 
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choice  categories.  The  classifications  can  be  revised  to  minimize 
such  interactions.   Estimates  of  travel  market  segment  shares  of  travel 
behavioral  responses  can  be  obtained  from  a  mail  or  telephone  survey. 
These  estimated  proportions  can  then  be  used  in  a  separate  sampling 
logistic  discrimination  that  is  eventually  proposed  as  a  procedure  to 
be  adopted  in  an  operational  transportation  study. 

The  theory  behind  separate  sampling  logistic  discrimination  is 
elaborated  on  and  its  discriminant  function  estimation  is  related  to 
that  of  the  mixed  distribution  unconstrained  discrimination  situation. 

Approaches  for  including  attitudinal  information  in  the  market 
segmentation  are  presented  in  Chapter  6.   By  virtue  of  the  nature  of 
the  information  on  attitudes  to  hypothetical  shopping  areas  (See 
Figure  6.1)  no  extensive  scaling  is  possible.  With  the  nature  of  the 
data  not  allowing  the  use  of  classical  successive  category  analytical 
techniques,  contingency  table  methods  are  used  to  organize  the  avail- 
able attitudinal  data  prior  to  its  use  in  market  segmentation  for 
multinomial  response  relation  analysis. 

Chapter  7  gives  the  empirical  investigation  with  the  methodologies 
developed  in  Chapter  5  and  6.  Models  fpr  various  market  segments  are 
discussed.  Some  multivariate  testing  is  conducted  on  the  need  for 
attitudinal  market  segmentation. 

Chapter  8  draws  conclusions  from  the  study.  The  major  proposals 
are: 

1,   The  adoption  of  a  systematic  definition  of  travel  choice  sets, 
considering  common  functional  destination  alternatives. 


The  use  of  a  multinomial  response  approach  as  a  vehicle  for 
developing  an  aggregation  scheme  linked  with  an  appropriate 
sample  survey  design. 

The  separate  sampling  of  travel  market  segments  at  activity- 
market  segment  interfaces  and  the  recommendation  that 
disaggregate  analysis  be  considered  in  the  framework  of 
separate  sampling  logistic  discrimination. 
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CHAPTER  2 
DESTINATION  CHOICE  MODELLING  FOR 
PRIMARY  URBAN  TRAVEL  BEHAVIOR 

Destination  Choice  Unclouded  by  Mode  Choice 

The  viewing  of  urban  travel  behavior  in  a  choice  framework  instead 
of  the  traditional  demand  approach  can  be  taken  as  the  current  state  of 
the  art  in  disaggregate  modelling.  The  urban  traveller  has  been  con- 
sidered as  a  consumer  choosing  among  alternative  trip  frequencies, 
periods  of  day,  modes  of  travel,  destinations,  and  routes  at  different 
decision  levels  or  simultaneously.  The  dominance  of  mode  choice 
considerations  in  most  of  such  traveller  choice  considerations  in  most 
of  such  traveller  choice  models  [13,  16,  17,  20,  41,  42]  is  readily 
detected.  The  spatial  context  of  destination  or  route  choice  behavior 
warrants  a  treatment  that  has  not  been  operationally  implicit  in 
current  methods  of  disaggregate  analysis . 

Spatial  choices  of  destinations  and  routes  at  a  micro  level 
involve  different  specific  choice  sets  for  individuals  depending  on 
their  geographical  location  and  the  time  of  day.  The  attraction 
measures  of  spatial  opportunity  sets  also  involve  a  lot  of  factors 
most  of  which  are  qualitative.  It  is  the  willingness  of  current 
disaggregate  modelling  approaches  [16,  17]  to  use  arbitrarily  selected 
specific  but  gross  destination  choice  sets  of  different  sizes  for 
different  individauls  that  poses  intractable  problems  in  terms  of 
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operational  practicability  and  analytical  feasibility.   It  is  quite 
easy  for  one  to  get  carried  away  by  endless  theoretical  excursions 
in  travel  behavior  analysis  and  over-generalize  the  models,  losing 
sight  of  the  ultimate  objectives  of  urban  transportation  planning.  An 
example  is  the  recommendation  that  models  include  linked  trips,  the 
variation  of  destination  choice  sets  with  time  of  day  and  various 
considerations  that  arise  in  local  travel,  when  one  is  still  unable 
to  adequately  analyze  the  one-way  travel  from  the  home  to  the  first 
destination.  The  functional  classification  in  an  urban  highway 
network  should  be  instructive  here.  Transportation  engineers  exper- 
ience greater  congestion  problems  with  higher  class  roads:  facilities 
like  freeways,  expressways  and  major  arterial  streets.  Travel  on 
local  streets  is  not  as  critical  as  that  on  some  higher  type  facilities. 
Even  on  the  arterial  network  it  is  the  traffic  movement  in  major 
corridors  that  is  of  major  interest.  Problems  involving  local 
travel  and  sections  of  arterial  streets  where  traffic  volumes  taper 
off  in  magnitude  fall  within  the  capability  of  traffic  operations  and 
need  not  be  considered  by  travel  behavior  models  until  travel  on  the 
primary  arterial  network  has  been  adequately  analyzed. 

It  is  the  issues  briefly  cited  in  this  section  and  the  need  to 
develop  interim  or  "compromising"  operational  models  of  urban  travel 
behavior  that  will  dictate  the  choice  of  methodology  to  be  presented 
in  this  report.  The  concentration  on  destination  choice  is  only  to 
allow  a  detailed  consideration  of  the  ramifications  in  the  analysis 
of  this  spatial  travel  choice  so  that  it  can  be  more  effectively 
incorporated  in  a  model  considering  all  relevant  travel  choices. 


12 


The  choice  of  the  shopping  trip  purpose  is  not  only  a  result  of  the 
nature  of  available  data  but  also  because  shopping  travel  is  next  in 
importance  to  work  travel  in  many  urban  areas. 

Destination  Choice  Set  Definition 

An  immediate  problem  that  arises  in  modelling  destination  choices 
is  that  of  defining  the  sets  of  alternatives  to  be  considered  by 
different  individuals  or  groups  of  people.  Gravity,  intervening 
opportunity,  or  more  general  entropy  models  as  well  as  utility  models 
of  interzonal  trip  distribution  [55,  56,  57,  58,  59]  assume  a  common 
set  of  physical  destinations  for  all  individuals  i.e.  each  individual 
within  a  given  zone  is  said  to  consider  every  other  zone  in  the  city 
as  a  potential  destination.  Notwithstanding  the  gross  measure  of 
attraction  implied  in  considering  zones  as  destinations,  some  destin- 
ations are  more  likely  to  be  selected  than  others.   If  the  different 
dimensions  of  attraction  of  activity  centers  were  to  be  fully  consid- 
ered the  total  number  of  distinct  alternatives  would  be  prohibitive 
even  for  aggregate  trip  distribution  procedures.  The  claim  that 
groups  of  people  in  zones  consider  zones  as  destination  alternatives 
is  more  parsimonious  than  the  expectation  that  individuals  considered 
in  disaggregate  models  [16,  17]  consider  zones  or  even  districts  as 
alternative  destinations.  The  rejection  of  the  assumption  that  indivi- 
duals have  a  common  physical  destination  choice  set  is  implied  in  work 
done  by  urban  geographers  in  the  individual's  cognition  and  information 
field  [60,  61]  and  by  currently  proposed  disaggregate  modelling  tech- 
niques [16,  17] . 
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It  is  quite  inconsistent  to  advocate  disaggregate  analysis  at 
the  behavioral  unit  -  the  individual  or  household  -  and  yet  grossly 
aggregate  the  specific  destinations  that  individuals  consider.  Urban 
travel  related  disaggregate  destination  choice  modelling  has  so  far 
been  confined  to  samples  taken  from  just  one  or  two  geographic  sectors 
of  a  city  for  which  the  set  of  zonal  shopping  destinations  is  arbitra- 
rily limited.  The  arbitrary  selection  of  alternative  destination  sets  , 
even  in  the  form  of  zones  using  zonal  retail  employment  as  the  attrac- 
tion measure* is  a  very  tedious  process  even  when  considering  a  couple 
of  sectors.   It  is  operationally  impractical  to  attempt  to  suggest 
such  a  procedure  for  a  larger  number  of  individuals  located  all  over 
a  given  metropolitan  area.   If  the  different  retail  activities  and  the 
multiple  classifications  of  retail  center  types  were  to  be  considered  , 
the  situation  becomes  exceedingly  complex.  The  alternative  shopping 
destinations  faced  by  individuals  depend  on  the  location  of  the 
individual  relative  to  the  total  set  of  alternatives  in  the  urban  area 
at  a  given  time  of  day,  the  socio-economic  class,  the  attitudinal 
orientation  of  the  individual,  and  the  categories  of  destinations  more 
likely  to  fall  within  the  individual's  consideration.  Other  factors 
include  the  familiarity  of  the  individual  with  the  urban  area  [51,  60, 
61]  as  well  as  the  cost  of  travel  and  ease  of  travel  to  each  shopping 
area.   The  present  state  of  knowledge  about  destination  choice  behav- 
ior and  the  data  collection  and  analytical  problems  cannot  permit  an 
adequate  consideration  of  all  the  above  factors  in  defining  destination 
choice  sets  for  an  operational  model  for  destination  choices  for 
primary  urban  travel. 
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To  operationalize  the  choice  set  definition  for  individuals  under 
present  circumstances  the  use  of  some  approximate  and  indirect  techni- 
ques is  inevitable.  Hanson  [61]  suggested  the  consideration  of  grocery 
stores  falling  into  distance  rings  centered  on  the  individual's 
location.  This  approach  is,  however,  too  simplified  for  urban  trans- 
portation planning.   It  is  possible  to  use  some  concepts  in  information 
theory  [105,106,107]  to  bound  choice  sets  for  individuals  given  suit- 
able data.  The  main  idea  is  to  select  shopping  areas  that  are  likely 
to  induce  primary  urban  travel  (i.e.  travel  that  v/ill  use  major 
arterials  of  interest  to  transportation  engineers) .  Having  identified 
such  shopping  areas  they  are  classified  from  the  point  of  view  of  each 
individual  household's  relative  location,  the  attraction  measures  for 
the  shopping  areas  and  the  ease  of  approach  to  the  latter  from 
different  directions.  A  relative  entropy  computed  from  each  indivi- 
dual's deduced  classification  table  determines  the  degree  to  which 
different  categories  of  shopping  areas  are  readily  available  to  the 
individual.   Depending  on  whether  the  relative  entropy  approaches 
unity  or  zero,  the  individual  is  likely  to  have  a  large  or  small 
choice  set  of  shopping  areas. 

A  way  to  get  around  the  individual  destination  choice  set  defini- 
tion problem  is  to  develop  functional  classifications  of  destinations 
so  that  each  individual  or  market  segment  faces  the  same  set  of 
functional  categories  of  destinations.   Factors  that  can  be  used 
operationally  for  shopping  area  categorization  with  respect  to  auto 
travel  may  include  airline  distance,  level  of  free  parking  availability, 
the  number  of  retail  outlets,  and  the  ease  of  shopping  area  approach 
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from  different  directions.  Even  the  functional  classification  of 
shopping  areas  on  two  or  three  of  these  factors  is  not  a  trivial 
matter.  The  limits  of  categorization  of  the  essentially  qualitative 
factors  involved  cannot  be  so  easily  determined.  It  is  possible 
to  test  for  independence  among  the  ways  of  classification  in  order  to 
achieve  mutually  exclusive  functional  classifications.   If  several 
interactions  are  detected  it  will  be  necessary  to  revise  the  ways  of 
categorization  or  to  transform  the  definitions  of  the  factors  used  for 
the  classification.  The  specific  set  of  physical  alternatives  that 
fall  into  the  functional  destination  category  that  an  individual  or 
market  segment  is  predicted  to  select^ can  be  identified  relative  to 
the  individual's  location.   For  example,  if  the  functional  category 
is  associated  with  medium  airline  distance  from  home,  a  high  level  of 
free  parking,  and  a  wide  availability  of  different  retail  outlets, 
the  physical  set  of  destinations  for  an  individual  includes  all 
shopping  areas  within  a  medium  airline  distance  range  with  the 
corresponding  levels  of  free  parking  and  availability  of  retail  out- 
lets. Computer  simulation  can  be  used  to  further  assign  individuals  to 
specific  physical  destinations  in  the  absence  of  a  better  procedure 
for  doing  so  [65,  80].  The  sample  size  needed  to  conduct  disaggregate 
modelling  using  such  functional  destinations  may  not  be  as  small  as 
many  researchers  have  been  hoping,  but,  more  importantly,  this  approach 
better  prepares  the  travel  behavior  analyst  to  deal  with  a  problem 
like  aggregation  more  effectively.  The  functional  destination  approach 
makes  the  analysis  of  destination  choices  more  systematic,  making 
possible  the  adaptation  of  statistical  methods  like  multinomial 
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response  relation  analysis  [66,  67]  for  the  analysis  of  market  segments 
that  can  be  identified  in  the  sampling  sense  as  different  strata  for 
which  a  travel  survey  has  to  be  designed.   It  will  be  further  proposed 
in  this  research  that  the  different  travel  market  segments  should  be 
sampled  separately^ preferably  at  activity  centers.  This,  in  principle, 
is  not  a  new  idea  in  transportation  planning  [68,  69,  70] .  A  familiar 
disadvantage  in  the  use  of  a  single  sample  involving  a  mixture  of 
market  segments  is  the  attempt  to  do  modal  choice  analysis  in  a  travel 
survey  only  to  find  out  that  a  negligible  percentage  of  the  sample  were 
in  a  position  to  trade  between  modes  of  travel.  The  contribution  of 
this  study  will  be  in  the  presentation  of  a  statistical  methodology 
that  handles  discrimination  with  separate  sampling.  The  use  of  multi- 
nomial response  relation  approach,  using  a  single  sample  from  old 
survey  data  or  a  mail  survey  provides  estimates  of  separate  sampling 
population  proportions  needed  in  the  separate  sampling  discrimination 
approach.   It  will  be  shown  in  Chapter  5  that  in  the  framework  of  a 
common  set  of  functional  destinations, a  multinomial  choice  model  is 
equivalent  to  a  multi-population  discrimination  problem.  It  is  the 
availability  of  theoretical  work  to  extend  the  mixed  distribution 
sampling  discrimination  estimation  problem  to  the  separate  sampling 
situation  that  dictates  the  preference  for  viewing  the  problem  as 
one  in  statistical  discrimination. 

Model  Development  With  Prior  Consideration  of  Aggregation 

In  order  to  make  aggregate  forecasts  of  travel  behavior, a  means 
of  accomplishing  this  must  be  developed  whether  one  starts  with  an 
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aggregate  or  disaggregate  set  of  models  for  a  given  travel  survey- 
sample  data.  Several  problems  that  can  be  encountered  in  the  process 
have  been  discussed  by  Talvitte  [71] .  Only  the  determination  of 
aggregate  forecasts  from  disaggregate  models  will  be  discussed  in  this 
section.  Recent  workers  on  such  aggregation  [71,  72]  have  proposed 
the  development  of  an  aggregate  forecasting  model  from  a  disaggregate 
model,  which  is  an  easy  matter  if  linearity  prevails.  However,  making 
aggregate  forecasts  can  be  considered  a  complex  sampling  problem  where 
disaggregate  models  are  used  for  prediction  within  a  given  sample  design 
based  on  a  rigorous  experimental  design  classification  scheme,  both  for 
the  travel  market  segments  and  travel  choices.  The  sampling  approach 
to  aggregation  will  be  recommended,  particularly  for  spatial  choices. 

In  a  multiple  choice  situation  the  probability  of  choice  of  an 
alternative  can  be  expressed  as  [36] : 

^^  ^  \i      gCx.  1  2.1 

Z  e    ^• 
1=1 

where  P.,  =  the  probability  that  an  individual  i  chooses  alternative  k 

out  of  a  total  of  K  alternatives. 

g(x.,)  =  a  function  Qisually  assumed  linear)  of  the  variables  or  factors 
for  alternative  1  that  affect  the  choice  decision.  This 
function  has  been  called  the  utility  function.  Though  the 
author  prefers  to  call  it  a  discriminant  function,  the  name 
"utility"  will  be  used  only  to  maintain  a  uniformity  of 
terminology. 
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Assuming  that  all  individuals  within  a  given  travel  market  segment 
of  size  M  attach  the  same  importance  to  rationalizing  the  decision, 
as  the  model  implies,  the  likelihood  of  each  individual  behaving 
rationally  is  1/M.  Using  a  Taylor's  series  expansion  up  to  second 
order  terms  on  the  individual's  cumulative  distribution  function  in 
equation  (1),  Talvitte  suggested  that  the  aggregate  share  of  the 
alternative  k  be  expressed  as  [71]  : 

k 

L 
1  =  1 


1, 

K   =  \f^  *   ^  Var(gCxi))-CPi  -  e]-CPi  -  j)] 


where  6=1    when  t  =  k 
6=0     otherwise 

Pfc  -  aggregate  share  of  alternate  k  in  a  market  segment 
Pj^  =  value  of  Pj^  evaluated  at  the  mean  of  G(x)  , 

Pj^  and  G(x)  being  general  expressions. 

Pj^j^  and  gC^-j^)  in  equation  2.1  are  values  taken  by  P.  and  GCx) 

respectively  for  individual  i. 

VarCgCx^))  =  variance  of  the  utility  function  in  the  market  for 

alternative  1 

X  in  gCx)  is  a  vector  of  socio-economic,  level  of  service  and 

activity  system  variables. 

The  evaluation  of  the  mean  and  variance  of  the  utility  function  is 
a  difficult  problem,  as  has  been  discussed  by  Talvitte  [71] .  Essenti- 
ally, assuming  activity  system  variables  to  be  constant,  Talvitte's 
discussion  assumes  the  canonical  correlation  between  the  set  of 
socio-economic  variables  and  the  set  of  level  of  service  variables  is 
negligible,  leaving  room  for  the  possibility  of  correlation  among 
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variables  within  either  set.  The  choice  model  of  equation  2.1  must 
be  the  first  to  consider  the  existence  of  correlation  or  interaction 
among  variables  or  factors.  Purely-  analytical  approaches,  even 
assiiining  all  the  components  of  x  are  continous  variables  with  a 
multivariate  distribution, can  easily  get  out  of  hand, even  if  one  uses 
well-developed  multivariate  normal  distribution  theory  [16,  73,  74], 
this  being  the  very  reason  why  the  multimonial  logit  model  was  postu- 
lated. A  further  point  of  observation  is  that  it  is  only  in  mode 
choice  related  situations  that  one  has  a  respectable  number  of  independ- 
ent variables  that  condition  choice  behavior.  With  the  introduction  of 
attitudinal  orientation  into  the  mode  choice  analysis, an  important 
qualitative  dimension  is  introduced.  Purely  analytical  approaches  to 
the  aggregation  problem  cannot  handle  situations  where  qualitative 
factors  conditioning  choice  behavior  cannot  be  ignored. 

Within  the  framework  of  disaggregate  analysis  suggested  by  recent 
works  that  have  considered  spatial  choices,  one  would  have  to  look  for 
sub-market  segments  that  have  the  same  set  of  spatial  alternatives 
before  an  analytical  aggregation  scheme  can  be  used.  This  approach  is 
only  a  step  away  from  trying  to  predict  the  choice  of  each  household 
or  block  of  households, which  is  obviously  an  impractical  task. 

It  has  been  suggested  [75]  that  building  separate  models  for 
different  market  segments  can  help  resolve  some  of  the  aggregation 
problems .  This  requires  a  rigorous  definition  of  market  segments 
with  both  homogeneous  population  characteristics  and  travel  behavioral 
and  attitudinal  orientations. 
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Some    difficulties  and  possible  inconsistericies  in  analytical 
approaches  to  the  aggregation  problem  have  been  discussed  above.   The 
predominance  of  categorical  and  qualitative  factors  in  travel  choice 
behavior,  particularly  for  spatial  choice,  and  the  requirement  for 
a  rigorous  definition  of  homogeneous  categories  of  travel  market 
segments,  further  complicate  the  aggregation  problem.  A  reasonable 
approach  is  an  experimental  design  method  that  provides  a  link  to  a 
complex  sampling  design  that  can  be  used  for  aggregate  predictions  of 
distinct  types  of  travel  behavior.   This  is  the  basis  for  recommending 
the  multinomial  response  relation  procedure  mentioned  earlier  in  this 
chapter  and  presented  in  detail  in  Chapter  5.  A  major  requirement 
of  such  a  statistical  approach  is  that  one  rigorously  define  the  v/ays 
of  classification  of  the  rravel  market  and  a  systematic  specification 
of  direct  alternatives,  like  modes,  and  conceptual  spatial  alternatives, 
like  functional  shopping  destinations,  discussed  in  the  previous  sec- 
tion. Drawing  an  analogy  with  methods  like  factor  analysis,  multi- 
dimensional scaling  and  principal  component  analysis  for  reducing 
the  dimensionality  of  problems  in  many  fields  [49,  54,  76,  77],  the 
concept  of  functional  spatial  alternatives  should  not  be  too  hard  to 
accept. 

Though  this  research  was  not  concerned  with  aggregation  models^ the 
above  discussion  has  been  necessitated  by  the  basic  requirement  that 
disaggregate  models  be  tied  with  a  suitable  aggregation  framework. 

The  Use  and  Methodological  Implications  of  Attitudinal  Data 

The  need  to  add  attitudinal  data  to  socio-economic,  transportation 
and  activity  system  variables  in  analyzing  travel  behavior  is  well 
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documented  [10,  21,  23,  43,  44,  45,  46].  Two  questions  that  may  be 
asked  relate  to  how  attitudinal  data  should  be  used  and  whether  they 
are  important  separately  or  in  combination  with  other  variables  and 
factors. 

There  has  been  the  use  of  attitudinal  indices  for  the  definition 
of  utility  "functions"  for  groups  that  individuals  may  belong  to, 
relative  to  a  given  item  of  travel  behavior  [23,  45,  47],  The  work 
along  these  lines  has  been  based  either  explicitly  or  indirectly  on 
Lancaster's  thesis  [48],  that  it  is  the  intrinsic  properties  of  a  good 
that  give  satisfaction  and  not  the  good  per  se.  The  implications  of 
this  in  modal  choice  has  been  discussed  by  Allen  and  Isserman  [23] . 
The  assumption  is  made  that  an  individual  behaves  rationally  in  that 
he  maximizes  the  utility  associated  with  the  attributes,  physical  or 
subjective,  posed  by  transportation  alternatives.  Compared  with  the 
substantial  work  done  on  attitudinal  and  perception  considerations  in 
modal  choice  behavior  [10,  21,  23,  43,  44,  78,  79],  similar  work  on 
destination  choice  has  been  mostly  limited  [75] .  Making  reference 
to  others*  work,  Burnett [75]  points  out  that  "socio-economic  character- 
istics do  not  appear  to  be  highly  correlated  with  cognition,  prefer- 
ence formation  and  overt  choice  behavior".  There  is  the  need  to 
incorporate  the  individual's  perception  or  attitude  towards  travel 
choices  directly.  Recent  work  on  the  perception  of  shopping  places 
[75,  61]  indicates  that  very  few  complex  attributes  are  used  by 
individuals  to  assess  alternatives .  These  attributes  do  not  appear  to 
bear  any  clear  relationship  to  the  size  and  distance  variables  assumed 
to  be  important  by  traditional  models  of  destination  choice  [58,  81]. 
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Some  recently  developed  multi-dimensional  scaling  techniques 
have  been  applied  in  some  studies  of  travel  behavior  £54,  77J , 
Their  appeal  is  supposed  to  be  in  the  hope  that  they  can  provide  a 
monotonic  scale  of  measurement  that  could  be  used  to  predict  individual 
preferences.  These  multi-dimensional  scaling  procedures  are  not  only 
expensive  and  difficult  to  implement,  but  the  nature  of  their  data 
requirements  can  be  too  involved  in  the  context  of  operational  travel 
surveys  even  for  short  range  purposes.  Considerable  effort  will  be 
needed  to  develop  a  methodology  appropriate  for  different  kinds  of 
travel  decisions  [51,  53,  82,  84].   It  is  appropriate,  at  least  for 
the  preliminary  work  on  attitudes  on  primary  shopping  travel,  to  use 
simpler  uni-dimensional  scaling  techniques  [10,  85,  86]  as  a  means  of 
introducing  attitudinal  data  in  the  analysis  of  destination  choice. 
A  contingency  approach  for  attitudinal  orientation  pattern  prediction 
is  discussed  in  Chapter  6. 
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CHAPTER  3 
GENERALIZED  PROBABILISTIC  CHOICE  MODELLING 

Much  of  the  recent  work  on  disaggregate  modelling  of  travel  choices 
[13,  16,  17]  has  been  based  on  probabilistic  choice  theory.  The  theory 
is  generalized  in  the  sense  that  physically  distinct  choice  sets  of 
different  sizes  have  been  considered  for  individuals.  The  foundations 
of  probabilistic  choice  theory  are  in  the  works  of  Luce,  Suppes,  Arrow 
and  Marschak  and  it  has  been  discussed  in  recent  travel-choice  behavior 
literature  [39,  87].   Some  of  the  following  discussion  that  introduces 
some  aspects  of  probabilistic  choice  theory  is  drawn  from  a  presentation 
by  Hansen  [88] . 

Probabilistic  Choice  Theory 

Luce's  basic  choice  axiom  defines  T  of  a  finite  subset  of  V  such 
that,  for  every  S  C  T,  P  is  defined. 

(i)   If  Pj^y(x,y)  J^  0,1  for  all  x,y  G  T,  then  P^(R)  =  Pg(R)-P^(S). 

(ii)  If  P   (x,y)  =  0  for  some  x,y  G  T,  then  for  every  S  C  T 
xy 

V^)  =PT-(x)t^-W)- 
In  the  above,  A  C  C  implies  A  is  a  subset  of  C.   Subscripts  refer 

to  the  set  for  which  the  probability  is  defined.  The  notation  P   (x,y) 

xy 

is  the  probability  that  x  is  chosen  over  y. 

The  Luce  model  can  be  shown  to  imply  the  existence  of  a  ratio 
scale  that  is  unique  up  to  a  positive  scalar  multiplication  and  is 
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independent  of  any  assumptions  about  the  structure  of  the  set  of  alter- 
natives when  one  is  confined  to  a  local  region  T  in  V.  The  Existence 
Theorem  [39]  is  as  follows: 

"Suppose  that  T  is  a  finite  subset  of  V,  that  P   (x,y)  j^  0,1  for 
all  X,  y  €  T,  and  that  axiom  1  holds  for  T  and  its  subsets,  then  there 
exists  a  positive  real-valued  function  U  on  T,  which  is  unique  up  to 
multiplication  by  a  positive  constant,  such  that  for  every  SC  T 

P  (X)  =  -i^W_  „ 


s^"^    i:  U(y) 
yes 

The  existence  theorem  deals  with  each  subset  in  V  in  which  pairwise 
discriminations  are  imperfect  (i.e.  all  P(x,y)  /  0,1).  Supposing  R 
and  S  are  two  overlapping  subsets  of  V  over  which  U-functions  are 
defined.  Arbitrary  scale  constants  can  be  chosen  so  that  a  single 
scale  will  prevail  over  the  overlap  region.   Luce  [39]  has  given  suffi- 
cient conditions  for  merging  local  U-scales  together  to  form  a  single 
scale  over  the  whole  set  V. 


(2^ 
According  to  Marschak  [87]  "U^   ,  a  real-valued  random  function  on 

called  ; 

every  x  ^  y. 


X  is  called  a  'random  utility  indicator'  in  the  binary  sense  if  for 


PCU^^-"  >  U^^^)  =  P   (x,y)" 
*■  x   —  y      xy^  '•''^ 

Special  random  utility  indicators  are  introduced  by  Marschak 
[87]  in  the  following  decreasing  order  of  generality: 

-  A  real-valued  function  w  on  x  is  called  a  weak  utility  function 

if  w  >  w  when  and  only  when  P   fx,y)  >  ■:;■  3.1 

X—  y  xy—  2 

-  A  real-valued  function  V  on  X  is  called  a, strong  utility  function 
if  there  exists  a  monotone  increasing  function  *  such  that  (|)  (V  -  V  ) 

=  Pxy*^""'^^'  *v^°^  "I  ^-^ 

-  A  positive-valued  function  U  on  X  is  called  a  strict  utility 
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U 
function  in  the  binary  sense  if  P   (x,y)  =  - — rp  3.3 

X  y 

In  this  sense.  Luce's  [39]  U-functions  are  "strict  utility  functions". 
Assuming  that  conditions  for  random  and  strong  utility  functions 

are  satisfied,  a  random  function  e  can  be  defined  as  a  random  compo- 

(2) 
nent  of  U    as  follows: 

X 

U^^^  =  V  +  e  where  E(e  )  =  0  3.4 

X       XX  X 

The  random  e  's  are  assumed  to  be  independently  distributed  and 
to  contain  the  effects  of  the  choice  situation  upon  utility  which  one 
is  unable  to  measure.   This  results  in  the  following  binary  choice 
probability  of  alternative  x 

P   (X,Y)  =P   (V  +e  >V  +e) 
xy        xy  X    X    y    y 

=  P   (e  -  e  <  V  -  V  )  3.5 

xy  y    X    X    y 

Attention  will  now  be  focussed  on  the  distribution  of  the 

random  variables  e  and  e  as  well  as  e*  =  e  -  e  . 

y      X  y    X 

With  V  and  V  linear  in  their  unknown  parameters,  it  has  been 
X      y  r  ' 

shown  that  a  variety  of  probability  distribution  functions  (p.d.f.)  are 
consistent  with  the  underlying  random  model  of  individual  utility 
maximization  for  binary  choice  [88]  .  The  most  useful  distribution  has 
been  the  logistic  distribution  [16] .   This  is  obtained  when: 

-t 

PfE.  <  t)  =  e~® 
^  1  — 

f(e.),  the  p.d.f.  of  e.  is  given  by 

_  -£. 

f (e. )  =  e  i  e  3.6 

-OO   <   £  .    <   OO 
—    1   — 
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The  following  relationships  are  defined: 


e  =  e  -  e^        and  w  =  e  3.7 

where  e  and  e  have  the  p.d.f.  given  in  equation  3.6.   The  Jacobian 

y    ^ 

of  transformation  in  equation  3.7  is  unity.   The  joint  p.d.f  of  e  and 

T     e-w    -w 
,   ^    e-2w  -e     -e 
w  IS  gLe.w)  =  e     e    .  e 

-       -w(l+e  ) 
e-2w    -e  *■    ^ 
=  e     ■  e         -00  <  e  <  00 

-  00  <  w  <  oo 


— w 
substituting  s  =  e  o<s<oo 


g-(s)    -  e^  /"se-^fl^^'^ds 
0 

put  V  =   s(l+e^)   whereby 


dv       .      e 
1+e 


e 
e  t  -V 


^  iUe^y       0 


-e  To  -e 


2 


from  which 

PC£^  e*)    =  —^  3.8 

1+e^ 

Substituting    e*  =    e     -    e 
^  y         X 

V 

X 

P      (x,y)    =  -n-^ Tj-  3.9 

xy  Vx         Vy 

V 
Defining  e  as  the  explicit  form  for  the  positive-valued  U  in 

Luce's  choice  model,  it  is  seen  that  the  logit  model  is  consistent  with 


27 


Luce's  ratio  scale  and  Marschak's  strict  utility  function  [87]. 

The  binary  logit  model  can  be  extended  to  the  multinomial  logit 
model  [13,  16]  in  the  multiple  choice  situation  as: 

V.   J   V. 
P^(i)  =  e  ^/  I     e  ^  3.10 

^  j=l 

where  P,(i)  is  the  probability  of  choosing  alternative  i  out  of  a 
choice  set  of  dimension  J. 

The  model  has  been  referred  to  as  an  explicit  form  of  the  strict 
utility  function  in  the  multiple  choice  setting  [87] .   The  assumption 
of  a  reciprocal  exponential  distribution  for  the  random  component  e.  can 
be  shown  to  be  equivalent  to  the  axiom  of  independence  from  irrelevant 
alternatives  implied  in  Luce's  choice  axiom. 

Thus  the  odds  of  choosing  alternative  i  over  alternative  k  are 
independent  of  other  alternatives.   It  is  noted  that 

J^   ■'        e     /D     _         1    k 


=  e 


-^      e  Vd 

J   V. 
where  D  =  ^  e   . 

The  multinomial  logit  follows  as: 

%J 

Estimating  the  Conditional  Logit  Model 

This  section  explores  the  implications  of  the  selected  linear 
utility  function  in  a  conditional  logit  model  from  the  point  of  view 
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of  statistical  inference  on  the  parameters  of  the  linear  utility 
function  estimated  by  the  method  of  maximum  likelihood.  Since  several 
references  will  be  made  to  a  paper  by  McFadden  [36]  a  notation  compa- 
tible with  that  in  this  paper  will  be  adopted. 

A  is  defined  as  the  set  of  alternatives  and  J  the  number  of 
alternative  choices  for  case  t  where  t  =  1,  2,..,,T. 

The  linear  utility  function  V.  in  the  previous  section  has  an 

equivalent  here,  denoted  by  b(i)  =  B(X.,9)  where  X.  is  a  vector  of 

variables  (of  dimension  K)  contributing  to  the  utility  function  for 

alternative  i  e  A  .   K  is  the  number  of  variables  and  9  is  a  vector  of 

unknown  parameters  defining  b(i)  =  X'. 6. 

For  i  e  A^,  t  =  1, . . , ,T  we  define  a  variable  f.  such  that  f.  =  1 
t  1  1 

if  alternative  i  is  chosen  and  f.  =  0  otherwise.   Thus 

1 

I       f-  =  1- 
i=A^ 

Using  the  logit  model,  the  probability  that  alternative  i  is 

chosen  is  given  by 

-i-      xle 

P  =  p(i:A  )  =  — 5 r-=e~^VD  3.12 

X.6 

x!e. 


where  D^  =  7   e  ■' 
t   •  , 
jeA^ 


The  likelihood  of  a  given  sample  is  given  by 

T  f . 

L  =  n   n   P(i:A  )  ^ 
5   t=l  ieA 
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T         f. 

=  n  n   p^  3.13 

t=l  ieA^ 

Let  the  log  of  the  likelihood  function  be  logCL  )  =  1(9) 

T 
L(0)  =     I       I       f.    log  P  3.14 

t=l  ieA   ^      ^   • 

Z!.e 

One  can  define  a  vector  Z..  =  X.  -  X.  in  which  case  P.  =  f  Y  e~-'^~l 

-Jl    ~3         ~i  1    ^.f;^       J 


I        Z!.6 
jeA  Z   e-^"~ 

a  log  P./9e  = zSy— 


jeA^ 


u\ 


=  y    p.  z.. 


=  [X,   I   P   -   I   P   X  ] 

=  [Xi  -  X^]  3.15 


where  X^  =  J       P.X 

t       -A       J~t 

JeA^  ■' 


a'log  P,     3 


aee      ae   "  jeA. 


r  [X.  -  y  p.  X.] 


99   jeA^ 


-  I       X  P  (9  log  P  /se') 


Since  P.  9  log  P./99'  =  8P./8e'. 
J        J   ~      J   ~ 

It  then  follows  that 
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3^1og  P. 

j — -=  -  I     x.p.cx!  -  xn 

aee     UK    -^  ■^ 

=  -  y  x.p.x!  +  c  y    p.xjx' 

=  -i   I       X.P.X!)  +  X^X' 

=  -  ^   (X  -  X  )P  ex.  -  X  )•.      3.16 

using  the  fact  that  I       P.  =  1.  Differentiating  equn.  (3.14)  and 

i  eA   "' 
using  (3.15)  and  (3.16)^ 

t=l  leA 


T  T 

t=l  ieA^  ^  ^   t=l  ^ 


3.17 


using  I       f^  =  1 
ieA^ 

2      T 

and        -r^T^r  =  -  1      1       (X.  -  X,)  P.(X.  -  X.)'         3.18 
^?^§'    t=l  jeA^  -^    '^   ^    -^        ~J 

The  maximum  likelihood  method  is  a  way  of  estimating  the  parameter 

vector  9.  This  is  accomplished  by  maximizing  L,  which  if  it  exists 

2 
must  require  9  L/3e89'  to  be  negative  definite  [36,89],  McFadden  [36] 

discussed  conditions  for  such  negative  definiteness,  as  well  as 
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necessary  and  sufficient  conditions  for  the  existence  of  a  vector 
maximizing  L(Q) . 

The  above  development  in  connection  with  the  multinomial  logit 
model  has  a  built-in  constraint:   J       P.  =  1.   For  this  reason  the 

parameters  of  a  general  b(i)  =  6  +  X.  ^e-*" . . .+X.,  9,  cannot  be 
estimated  uniquely,  the  dimension  of  the  estimation  space  being  reduced 
by  the  number  of  constraints.   This  is  noticed  by  considering  a  general 

linear  form  of  b(i)  in  equation  (3.18)  where  the  first  row  of  X.  is  unity 

2 
in  which  case  8  L/96e'  will  have  a  zero  first  row,  making  it  singular. 

If  the  constant  term  in  b(i)  is  not  zero  then  all  parameters  estimated 
are  confounded  with  the  effect  of  6.,  the  constant  term.  Another  way 
of  viewing  this  problem  is  that  the  response  probabilities  determined 
with  the  logit  model  are  invariant  under  change  of  location  of  the  logits. 
The  parameter  corresponding  to  the  general  mean  (constant  term  in  b(i)) 
is  indeterminate.   There  are  very  serious  implications  in  this  fact  in 
the  use  of  logit  model  parameters  for  computing  quantities  like  elas- 
ticities and  the  value  of  travel  time. 

Proofs  about  some  statistical  properties  of  the  conditional  logit 
model  require  that  X.  have  a  continuous  non-degenerate  distribution. 
It  is  possible  to  establish  such  proofs  [86,  89,  90]  for  categorical 
variables  by  a  suitable  reparameterization  of  the  model  so  that  the  rows 
of  (X,,  X  ,  ...,X_,)  are  linearly  independent.  A  usual  procedure  for  such 
reparameterization  is  to  exclude  one  category  of  the  categorical 
variables  so  that  unique  estimates  of  categorical  parameters  are  no 
longer  available  even  if  the  constant  term,  0,,  in  b(i)  is  zero  [86, 
90] .   In  a  situation  where  6,  is  not  necessarily  zero  and  there  is  a 
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mixture  of  categorical  and  continuous  variables  in  X.  the  parameters 
are  confounded  with  the  effects  of  both  the  mean  and  excluded  category- 
effects.   For  example,  defining  X..  as  the  ith.  component  of  X.,  if 
X.-  is  continuous  and  X._  and  X..  are  two  categories  of  a  3-category 
variable  with  category  X   excluded,  the  parameters  estimated  will  be 
(Qj  +  62  +  e^),  (e^  +  ^3  •*■  ^5)  a"«l  i^i   ""  ®4  ■"  ^5^-  Hypotheses  about 
individual  parameters  are  thus  not  testable.   Suitable  testable  contrasts 
can  be  developed.  An  example  is  (9  +  9j  +  9c)  -  (9i  +  9  +  6^)  = 
9  -  0_.   Elasticities  and  other  values  can  be  computed  from  the  esti- 
mated parameters  only  through  suitable  functions  of  estimable  functions 
[92,93].   By  introducing  time  and  cost  variables  in  ranges,  the  effects 
of  time  and  cost  over  different  ranges  can  be  tested  for  equality.   It 
is  then  possible  to  determine  unconfounded  values  of  travel  time  over 
intervals  where  time  and  cost  effects  are  constant.  The  reference  to 
the  value  of  travel  time  is  only  a  result  of  the  wide  use  of  logit 
models  in  mode  choice  analysis. 

That  the  previous  discussion  can  help  clarify  the  way  the  estimated 
parameters  of  the  logit  model  can  be  used  is  not  sufficient  if  such  a 
model  is  going  to  be  a  source  of  policy  making  in  the  transportation 
area.   Proper  statistical  inference  that  can  be  made  about  the  parameters 
of  the  model  depends  on  the  statistical  properties  of  large  sample  of 
maximum  likelihood  estimators.   Though  extensive  sensitivity  analyses 
would  be  needed  to  make  conclusive  statements,  care  must  be  taken  for 
sample  sizes  used  to  allow  such  desirable  properties  of  maximum  likeli- 
hood estimators  to  be  used. 
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As  discussed  in  Chapter  5,  parameters  under  maximum  likelihood 
estimation  have  a  joint  distribution  which  is  asymptotically  multivari- 
ate normal  in  large  samples.   Table  3.1,  shows  the  minimum  total  sample 
size  in  the  smaller  class  required  to  use  the  normal  approximation  in 
sampling  for  the  proportion  in  the  smaller  class,  p,  in  a  binary 
situation.   In  a  typical  mode  choice  problem  the  transit  users  fall  in 
the  smaller  class  whilst  auto  users  are  in  the  larger  class.   In 
several  cities,  p,  hardly  exceeds  0.1  in  which  case  a  reasonable  sample 
size  should  be  around  600.   Considering 


Table  3.1  MINIMUM  TOTAL  SAMPLE  SIZE  TO  VALIDATE  NORMAL  APPROXIMATION 
IN  THE  BINOMIAL  SAMPLING  FOR  PROPORTIONS 
(Adapted  from  Cochran  [64]) 


Proportion  in  Smaller  of  Two  Classes     Minimum  Total  Sample  Size 

0.5  30 

0.4  50 

0.3  80 

0.2  200 

0.1  600 

0.05  1400 


that  disaggregate  mode  choice  logit  models  estimate  several  parameters 
that  determine  the  probability  of  each  individual  belonging  to  either 
class,  and  the  intractability  of  small  sample  maximum  likelihood  esti- 
mation where  there  are  more  than  two  classes  and  where  one  is  interested 
in  estimating  models  for  different  market  segments,  larger  (but  not 
excessive)  sample  sizes  would  be  required.  How  to  use  a  larger  sample 
in  disaggregate  analysis  of  urban  travel  behavior  efficiently  in  the 
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sense  of  a  well  designed  sample  survey,  without  necessarily  collecting 
as  much  data  as  in  previous  transportation  studies  is  discussed  in 
detail  in  Chapter  5. 

Some  Fundamental  Assumptions  in  Estimating  the  Conditional  Logit  Model. 

In  his  exposition  on  the  statistical  properties  of  the  conditional 

logit  estimator,  McFadden  [36]  defined  the  log  of  the  likelihood 

T 
function  (see  equation  3.14)  as  L(T, e)  =  I  ^(t,e)  where 

t=i 


X(t,e)  =  I   f.  log  P     t  =  1,...,T 


Defining  \     =   8A/3e  and  AOB'  =  a^A/3eo' 


EXgCt.e")  =  I      (3P./39)e°  =  0  3.19 

o 
L(T,6)  is  maximized  at  9  . 


LCT,e)  is  maximized  for  3L(T,6)/30 [9°  =0  ,  3.20 


The  expected  value  of  A   '  is  defined  as 


o  =  _  y   P.(3^Log  P./3e9')  3.21 


E   (n  )  is  defined  as  ^^    (J  ) 
~t  -    t 


and  a  matrix  Q   is  defined  as 

^  =     I     V  fi  (J)  3.22 

J=2  '^"'^ 

where  —  0.       is  the  asymptotic  covariance  matrix  of  9  and  v  is  the 

relative  frequency  with  which  J  =  J  is  observed. 
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Equation  (3.22)  implies  that  each  observation  with  J  alternatives 
is  necessarily  drawn  from  the  same  subsample  for  the  purpose  of  pooling 
to  estimate  Q.     Just  as  it  is  untenable  to  expect  individuals  or  cases 
•  to  have  the  same  set  of  physical  spatial  alternatives  [16,17]  it  is 
unreasonable  to  imply  in  such  pooling  that  individuals  with  the  same 
number  of  physical  alternatives  belong  to  the  same  subsample.   If  one 
is  willing  to  consider  each  individual  case  to  have  a  different  set  of 
alternatives  one  should  be  prepared  to  statistically  let  several 

observations  for  the  individual  constitute  a  subsample  in  which  case 

1  '^ 
n  is  estimated  by  —  1     Q   .     The  estimate  for  fi  in  equation  (3.22), 

»  though  statistically  invalid,  would  only  serve  the  purpose  of  an  infor- 
mation matrix  in  maximum  likelihood  estimation  (89) .   Its  use  for 
statistical  inference  would  be  incorrect.   It  is  more  parsimonious,  for 
example,  in  destination  choice  modelling,  to  consider  clusters  of 
households  with  the  same  set  of  physical  alternatives  so  one  can  define 
fi^(l)  for  the  1th  cluster  location  which  will  go  into  pooling  the 
estimates  of  fi  from  different  clusters.  This  amounts  to  the  analysis 
of  market  segments  where  the  segmentation  is  geographical.  While  this 
is  theoretically  possible,  there  are  so  many  physical  destinations  and 
so  many  clusters  of  households  that  can  have  the  "same"  constrained  set 
of  physical  choice  sets  that  one  can  quickly  run  into  operational 
problems.   In  the  context  of  functional  destinations  discussed  in 
Chapters  2  and  5,  a  conscious  attempt  is  made  to  reduce  such  unnecessary 
complexity  in  dealing  with  physical  alternatives  directly. 

The  arbitrary  selection  of  unchosen  alternatives  in  situations 
where  information  on  such  alternatives  is  not  available  could  cause 
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further  problems  in  the  form  of  judgemental  bias.   If  it  is  possible, 
repeated  observations  for  each  individual  should  go  into  the  estimation 
of  ^  I      this  is  virtually  impossible  unless  physical  experimental  condi- 
tions prevail.   The  generalized  alternative  set  modelling  approach 
without  repeated  measurements  on  individuals  is  invalid  statistically 
if  one  is  not  prepared  to  accept  all  the  underlying  assumptions  in  the 
estimation  procedure  as  discussed  above.   It  is  only  under  repeated 
observations  under  which  an  individual  chooses  among  different  alterna- 
tives with  some  relative  frequency  distribution  that  one  can  develop  a 
model  for  predicting  which  one  alternative  the  individual  would  select 
under  a  given  situation.   Since  it  is  impossible  to  do  this  for  each 
individual  and  it  is  obviously  not  what  was  intended  in  the  proposition 
of  the  generalized  choice  set  approach*  there  is  a  serious  need  to  revise 
the  whole  analytical  procedure. 

Summary 

Some  aspects  of  probabilistic  choice  theory  leading  to  the  binary 
logit  choice  model  and  its  extension  to  the  multiple  choice  setting 
are  discussed.   Some  assumptions  necessary  for  maximum  likelihood 
estimation  of  the  conditional  multinomial  logit  model,  as  developed 
by  McFadden,  and  statistical  inferences  on  the  parameters  are  also 
discussed.   It  is  also  indicated  that  although  the  theoretical  devel- 
opment behind  the  estimation  of  the  coyariance  matrix  of  the  conditional 
logit  model  parameters  assumes  the  estimate  is  pooled  over  estimates 
for  different  choice  set  sizes,  the  actual  estimate  considers  all  the 
units  as  belonging  to  the  same  group.   It  is  maintained  that  the 
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over-generalization  implied  in  using  variable  destination  alternative 
choice  sets  for  different  individuals  stifles  both  the  covariance 
matrix  estimation  procedure  and  further  complicates  any  possible  . 
analytical  aggregation  techniques. 


38 


CHAPTER  4 
EMPIRICAL  ANALYSIS  USING  SOME 
CURRENT  DISAGGREGATE  METHODOLOGY 

Introduction 

This  chapter  first  discusses  the  background,  collection  and  pre- 
paration of  the  data  base  used  for  some  empirical  analyses  in  this 
chapter,  and  later  chapters. 

The  empirical  analyses  presented  in  this  chapter  examine  the 
predictive  performance  of  the  multinomial  logit  model,  presented  in 
Chapter  3,  for  specific  shopping  destination  areas.  A  difference  be- 
tween the  models  discussed  and  others  on  shopping  destination  choices 
is  that  the  latter  have  been  connected  with  travel  mode  choice  con- 
siderations and  have  considered  alternative  destinations  as  zones  or 
districts.  The  analysis  presented  in  this  chapter  involves  automo- 
bile users  and  the  destinations  are  specifically  identifiable  combina- 
tions of  shopping  centers  and  retail  outlets.  Models  developed 
using  different  variables  and  their  combinations  are  presented.  The 
effect  of  significant  correlation  between  some  of  the  variables  or 
the  predictive  ability  of  the  resulting  models  is  also  discussed. 

A  subsample  of  the  data  sample  available  was  used  to  estimate 
the  multinomial  logit  models  discussed  in  this  cahpter.  The  objective 
ves to  conduct  a  modest  empirical  analysis  to  investigate  the  perform- 
ance of  this  disaggregate  modelling  approach.  One  useful  purpose 
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that  the  analysis  served  was   to  indicate  the  general  effectiveness 
of  the  available  variables  for  developing  alternative  models  for 
functional  shopping  destination  choice  as  presented  in  Chapter  7. 

The  Data  Preparation 

The  data  in  this  research  were  derived  from  a  home-interview 
survey  in  Indianapolis,  Indiana  organized  by  Kannel  [10] .supplemented 
with  additional  data  collection  and  preparation.  Kannel  collected 
home  interview  and  attitudinal  data  from  a  sample  of  357  single-family 
households  in  the  fall  of  1971,  the  households  having  maintained  their 
residential  location  for  a  period  of  at  least  seven  years.  The 
Indianapolis  metropolitan  area  with  a  population  of  about  744,748  in 
1970  had  conducted  a  conventional  transportation  study  during  the 
fall  of  1964.  A  five  percent  sample  of  households  was  taken  for  the 
home  interview  portion  of  the  study  from  which  a  total  of  10,532 
interview  forms  were  completed. 

A  major  objective  of  Kannel 's  research  was  to  monitor  trip 
generation  changes  between  1964  and  1971.  It  was  necessary  to  use 
households  that  had  remained  at  the  same  address  so  influences  of 
changes  in  household  location  relative  to  the  transportation  system 
would  not  distort  the  results  of  the  study.  In  order  to  minimize  the 
differences  in  travel  behaviour  that  might  be  due  to  differences  in 
life  style  of  families  living  in  different  types  of  dwelling  units > 
only  single  family  dwelling  units  were  used  in  Kannel 's  survey.  Since 
trip  generation  was  a  major  concern  in  Kannel 's  work  it  was  necessary 
to  obtain  a  good  representation  of  variables  like  family  income,  auto 
ownership  and  family  size  in  his  sample  selection.  The  list  of 
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single-faunily  dwelling  units  that  had  reported  their  income  in  1964 
totalled  4,300.  A  systematic  one  in  four  sampling  procedure  was  used 
to  obtain  a  basic  list  of  households.   Each  family  in  this  list  was 
checked  against  the  listings  in  the  1971  Indianapolis  telephone  dir- 
ectory to  determine  which  households  were  still  at  their  1964  address- 
es. The  aim  was  to  obtain  a  list  of  potential  households  that  would 
yield  about  400  completed  interviews.  Though  an  initial  sample  of 
400  households  was  selected  to  be  interviewed,  a  list  of  additional 
households  was  developed  to  replace  refusals  from  households  in  the 
original  list. 

Although  a  conscious  effort  was  made  to  represent  different  com- 
binations of  family  size,  family  income  and  auto  ownership,  the  results 
of  the  1971  survey  reported  only  three  percent  of  the  trips  being  made 
by  suburban  or  city  bus  lines.  Thus  the  sample  essentially  represents 
a  group  committed  to  the  automobile.   In  addition  to  conventional 
origin-destination  information,  attitudinal  data  was  obtained  in 
structured  and  unstructured  forms.  The  structured  responses  were  in 
the  form  of  paired  comparison  and  successive  categorical  techniques 
to  evaluate  individual  attitudes  to  various  attributes  of  modal, 
shopping  destination  and  route  choices.  The  unstructured  part  of 
the  questionnaire  obtained  data  on  trip  length  and  cost  characteris- 
tics for  the  work  trip.  The  shopping  trip  was  investigated  with 
respect  to  the  time  of  day  of  travel,  mode  of  travel,  length  of  trip 
and  reason  for  a  destination  selection  for  a  shopping  trip  made 
during  a  previous  one  week  period.  The  work  in  the  present  research 
on  shopping  destination  choice  modelling  will  be  centered  around  the 
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portion  of  Kannel's  survey  on  a  recent  shopping  trip  made  and  the 
successive  categories  data  on  factors  influencing  the  choice  of 
shopping  destinations. 

Of  the  357  completed  interviews  obtained  by  Kannel  the  principal 
respondent  was  male  in  162  households  and  female  in  195  households. 
Respondent  characteristics  that  may  help  in  interpreting  results  in 
model  development  relate  to  age,  income  and  occupation.   The  average 
age  of  the  head  of  household  was  52  years  with  a  median  education 
attainment  of  11.8  years.   The  family  income  and  occupational  cate- 
gories are  shown  in  Tables  4.1  and  4.2  respectively.  Tables  4.3  to 
4.5  present  information  on  educational  level,  age  and  family  status 
distributions. 

Supplementary  Data  Co3 lection  and  Preparation 

With  the  aid  of  publications,  maps,  aerial  photographs  and  other 
material  from  the  Indianapolis  Department  of  Metropolitan  Development, 
the  Indiana  Chamber  of  Commerce,  the  Indianapolis  Star  and  the  Indiana 
State  Highway  Department  additional  data  collection  and  preparation 
were  carried  out  to  develop  a  complete  data  set  for  this  research. 

Having  identified  the  location  and  checked  the  zonal  area  codes 
for  the  1971  sample  households,  the  shopping  destinations  of  those 
who  responded  to  a  probe  into  a  recent  shopping  trip  were  determined. 
A  land-use  map  and  addresses  of  major  shopping  centers  or  clusters 
of  major  shopping  outlets  were  used  to  identify  the  most  likely  des- 
tination using  the  description  of  the  closest  intersection  to  the 
shopping  destination  selected  by  the  respondent.  The  most  reasonable 
route  for  each  such  trip  was  identified  after  a  careful  examination 


Table  4.1.  Average  Household  Income  Distribution 
for  Survey  Sample. 
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Income  Stratifications 


1971 


Households 

Percent 

21 

5.9 

14 

3.9 

20 

5.6 

25 

7.0 

20 

'    5.6 

38 

10.7 

49 

13.7 

61 

17.1 

45 

12.6 

31 

8.7 

33 

9.2 

Less  than  $4,000 
$4,000  to  4,999 
5,000  to  5,999 
6,000  to  6,999 
7,000  to  7,999 
8,000  to  9,999 
10,000  to  11,999 
12,000  to  14,999 
15,000  to  17,999 
18,000  to  23,999 
Greater  than  24,000 


Table  4.2.  Occupation  of  Head  of  Household 
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1971 

Number 

Percent 

Professional 

43 

12.0 

Managers 

32 

9.0 

Clerical 

23 

6.5 

Salesman 

40 

11.2 

Craftsman 

75 

21.0 

Operatives 

31 

8.7 

Service  Personnel 

16 

4.5 

Retired  or  Unemployed 

68 

19.1 

Total 

357 

100.0 
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of  a  network  map  with  the  assistance  of  a  person  familiar  with  the 
Indianapolis  area.   Using  travel  speed  study  results  between  1970 
and  1972, typical  speeds  were  estimated  for  different  categories  of 
road  after  which  travel  time  determinations  were  made  to  alternative 
shopping  destinations. 

Complete  information  was  available  for  only  the  major  shopping 
areas  listed  in  publications  on  retail  outlets  in  the  Indianapolis 
area  obtained  from  the  Indianapolis  Star  and  the  Chamber  of  Commerce. 
Thirty-eight  shopping  destination  areas  were  identified,  each  consist- 
ing of  one  major  shopping  center  or  a  collection  of  shopping  centers 
and  other  retail  outlets.  Information  compiled  for  each  shopping  area 
included  the  floor  area,  the  number  of  parking  spaces,  the  number  of 
discount  stores,  the  number  of  grocery  stores  and  the  number  of  other 
retail  outlets  and  the  distance  of  the  shopping  area  from  downtown 
Indianapolis. 

Having  restricted  the  analysis  to  travel  to  major  shopping  areas 
that  are  more  likely  to  attract  major  volumes  of  primary  ui'ban  travel, 
the  sample  size  useful  for  destination  choice  modelling  was  reduced 
to  about  220-households.  Out  of  a  subsample  of  120  selected  from 
the  original  sample  of  357  households  87  were  found  to  have  all  the 
information  needed  to  develop  a  disaggregate  model  for  predicting 
specific  destination  choices  with  respect  to  travel  to  major  shopping 
areas . 

A  LARR-V  digitizer  was  used  to  specify  the  geographic 
coordinates  for  each  household  and  the  center  of  each  shopping  des- 
tination area.  The  coordinates  enabled  airline  distances  from 
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households  to  shopping  centers  and  the  bearing  of  each  household 
from  each  shopping  area  to  be  computed.   Four  directions  of  approach 
to  each  shopping  area  were  defined  as  illustrated  in  Figure  4.1. 
The  bearing  of  approach  of  each  household  to  a  shopping  area  allowed 
the  direction  of  approach  to  be  quickly  determined.   From  the  highway 
network  configuration,  the  limited  information  on  24-hour  link  volumes, 
the  number  of  arterials  of  approach  and  the  number  of  intersections 
on  different  routes,  it  was  possible  to  assign  subjective  ratings  of 
ease  of  approachability  from  each  direction  to  each  shopping  center. 
In  practice,  where  the  number  of  shopping  areas  may  be  high,  a  more 
objective  determination  of  such  an  approachability  measure  would  be 
needed. 

Three  additional  alternative  shopping  destinations  were  select- 
ed for  each  household  to  provide  four  alternatives  for  each  house- 
hold. Travel  time,  airline  distance,  the  level  of  approachability 
and  shopping  opportunity  measures  were  assembled  for  alternative  des- 
tinations for  each  household,  manually  and  with  the  aid  of  appropriate 
computer  programs.   Cost  data  were  not  collected  since  with  auto 
travel  the  travel  cost  is  highly  correlated  with  the  travel  time  and 
also  because  most  of  the  parking  reported  for  the  shopping  areas  was 
free.  The  variables  for  which  there  was  sufficient  variation  over 
the  subsample  of  87  to  allow  a  meaningful  application  of  the  multi- 
nomial logit  modelling  approach  are  given  in  the  following  table: 
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NORTH 


SOUTH 


FIGURE   4.1    DIRECTIONS     OF    APPROACH    TO    A 
SHOPPING     AREA 
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Table  4.3.   Education  of  Respondents 


1)  Eight  grade  or  less 

2)  Two  years  of  High  School 

3)  High  School  Graduate 

4)  Two  years  of  College 

5)  College  Graduate 

6)  Graduate  Work 

7)  Vocational  School 


Number 

of 

Percent  of 

Respond 

ents 

Respondents 
in  Group 

43 

12 

48 

14 

139. 

39 

59 

17 

33 

9 

22 

6 

12 

3 

Table  4.4.  Age  Distribution  of  Head  of  Household,  1971 


Less  than  35 
35  to  44 
45  to  54 
55  to  64 
65  and  over 


Number  in 

Percent  in 

Age  Group 

Ag 

e  Group 

22 

■6 

59 

17 

115 

33 

95 

27 

66 

19 

Table  4.5.   Family  Status  of  Households  in  1971 

Percent  of  Households 
in  Group  


Married  with  school  age 

children 
Married  without  school  age 

children 
Not  married 


43 

39 
18 
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Table  4.6.  Multinomial  Logit  Model  Variables 

FLA  -  Retail  Floor  Area  in  1000  sq.  ft. 

PRK  -  Niunber  of  Parking  Spaces 

TL  -  Line  Haul  Travel  Time 

TA  -  Line  Haul  Travel  Time  plus  adjustments  for  congestion  and 

delays  at  trip  origin  and  destination 
DAL  -  Airline  Distance 
AP  -  Approachability  "Rating     1.   Poor 

2.  Moderate 

3 .  Good 


Application  of  the  Multinomial  Logit  Model  in  the  Prediction  of 
Specific  Shopping  Destinations 

Objections  were  raised  in  Chapters  2  and  3  with  regard  to  opera- 
tionalizing  the  multinomial  logit  model  as  currently  applied  in 
spatial  travel  demand  analysis  with  respect  to 

1.  The  destination  choice  set  definition 

2.  A  feasible  determination  of  aggregate  predictions 

3.  The  estimation  of  and  inferences  about  model  parameters. 
It  was  indicated  in  Chapter  3  that  parameter  estimates  would 

still  be  feasible  in  spite  of  the  non-compliance  with  all  the  necess- 
ary statistical  conditions  for  proper  inferences  on  the  parameters. 
The  remainder  of  this  Chapter  will  illustrate  why  parameter  estimation 
is  possible  for  the  problem  at  hand  and  discuss  the  performance  of  the 
multinomial  logit  model  in  predicting  the  choice  of  specific  shopping 
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destination  alternatives  with  various  combinations  of  variables.  The 
empirical  investigation  will  also  serve  as  a  useful  giaide  in  the  selec- 
tion of  variables  for  use  in  the  development  of  models  for  the  predic- 
tion of  the  choice  of  functional  travel  destinations  discussed  in 
Chapter  7.  The  estimation  program  used  was  obtained  from  Moshe  Ben- 
Akiva  [17]. 

In  short,  the  objection  to  the  covariance  matrix  estimated  for 
the  parameters  raised  in  Chapter  3  amounts  to  the  fact  that  although 
the  final  estimate  is  supposed  to  be  pooled  over  estimates  for  differ- 
ent choice  set  sizes,  the  estimate  actually  computed  disregards  this 
fact  and  derives  an  estimate  that  considers  all  the  units  as  belonging 
to  the  same  group.  This,  of  course,  assumes  that  the  number  of  alter- 
natives is  a  valid  basis  for  grouping  with  respect  to  the  estimation 
of  the  covariance  matrix.  On  the  application  discussed  in  this 
chapter  it  just  happens  that  the  same  number  of  alternatives  was 
selected  for  each  household  so  the  problem  with  the  covariance  matrix 
does  not  arise. 

The  estimated  models  discussed  for  shopping  destination  area 
choice  are  based  on  the  following  relationship  in  which  the  para- 
meters  6  are  estimated. 

q 


Pj   k 

Log-i=  E  ex.  -  Xi  )e 

^i   q=l   -"q     q  ^ 
where  P.  =  predicted  probability  of  choosing  alternative  j 

p.  =  predicted  probability  of  choosing  alternative  i,  the 
actually  chosen  alternative 
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X-  =  the  qth  independent  variable  for  alternative  j 

X^   =  the  qth  independent  variable  for  the  chosen  alternative  i 

q 

9q  =  the  parameter  estimate  corresponding  to  the  qth  indepen- 
dent variable. 


Specific  Destination  Choice  Models 

Figure  4.2  shows  the  correlation  matrix  for  the  variables  assumed 
to  act  as  continuous  variables  in  the  modelling  discussed  in  this 
chapter.  It  was  anticipated  that  the  appearance  of  any  pair  of  signi- 
ficantly correlated  variables  in  the  same  model  would  have  a  confound- 
ing effect  on  the  predictive  effectiveness  of  the  model.  Table  4.7 
shows  the  number  of  times  the  multinomial  logit  model  developed  with 
each  variable  or  combination  of  variables  correctly  predicts  the 
actually  chosen  alternative.  A  summary  of  the  parameter  estimation 
results  is  given  in  Table  4.8. 

Considering  single  variables  alone,  the  number  of  correct  pre- 
dictions in  Table  4.7  ranges  from  42  for  Line  Haul  Travel  Time  (TL) 
to  25  for  Airline  Distance  (DAL)  out  of  the  total  of  87  households 
considered.  As  expected,  every  correct  prediction  made  for  travel 
time  that  allows  for  terminal  times  (TA)was  made  by  Line  Haul  Travel 
Time  (TL) ,  the  latter  making  five  more  correct  predictions.  The 
same  comparison  can  be  established  between  Floor  Area  (FLA)  and 
the  Parking  Spaces  Available  (PRK)  with  Floor  Area  making  more 
correct  predictions. 

When  pairs  of  variables  that  are  not  significantly  correlated 
were  used  very  little  or  no  improvement  in  prediction  accuracy  was 
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FLA 

TL 

TA 

DAL 

PRK 

TOT 

FLA 

— 

0.2949 

0.3071 

0.1802 

0.7335 

0.8169 

TL 

— 

09773 

0.7865 

0.0570 

0.1408 

TA 

— 

0.7643 

0.0311 

0.1287 

DAL 

— 

O0064 

O0485 

PRK 

— 

O8088 

TOT 

— 

FIGURE    4-2  CORRELATION     MATRIX    FOR     LOGIT 
MODEL    VARIABLES 
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Table  4-7     Predictivs    Accurocy    of   Conditional    Loqit     Model"'   "In  Specific    Shopping 
Destinalion   Choice  Prediction 


Household 
Number 

Set  of    Eiplonalory    Vorlobles 

TL 

TA 

FLA 

P  R  K            TOT 

DAL 

FLA, DAL 

1 

X 

X 

X 

X 

2 

X 

X 

3 

X 

X 

X 

4 

5 

X 

6 

X 

X 

X 

X 

7 

X 

8 

X 

9 

X 

10 

X 

X 

1  1 

X 

X 

X 

12 

X 

X 

X 

X 

13 

X 

X 

X 

14 

15 

X 

X 

X 

X 

X 

X 

16 

X 

X 

X 

X 

X 

17 

X 

X 

X 

X 

X 

X 

18 

X 

X 

X 

X 

X 

X 

19 

X 

X 

X 

20 

X 

X 

X 

21 

X 

X 

X 

X 

X 

X 

22 

X 

X 

X 

X 

X 

X 

23 

X 

X 

X 

X 

X 

X 

24 

25 

X 

X 

X 

26 

X 

X 

27 

X 

X 

28 

X 

X 

29 

X 

X 

X 

X 

X 

30 

X 

X 

X 

31 

X 

X 

X 

32 

X 

X 

X 

33 

X 

X 

X 

X 

X 

34 

X 

X 

X 

35 

X 

X 

X 

X 

X 

36 

X 

X 

37 

X 

X 

X 

38 

X 

X 

39 

X 

X 

40 

X 

41 

X 

X 

X 

X 

42 

X 

X 

X 

43 

X 

X 

X 

44 

X 

X 

X 

X 

The  Legend  for 
the  Variables  in 
Table  .  4-6 

> 
'X'indicoles 

Correct  Prediction 


Tobl«  4-7  Conl. 
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otol  Correcl 
Prsdiclions 


Household 
Number 

Set   of    Explonolory    Varioblej 

TL 

TA 

FLA 

PRK 

TOT 

DAL 

FLA.DAL 

45 

X 

X 

X 

X 

46 

X 

X 

X 

X 

47 

X 

X 

X 

X 

48 

X 

X 

X 

X 

49 

X 

X 

50 

X 

X 

X 

X 

X 

X 

51 

X 

X 

52 

X 

X 

53 

X 

X 

X 

X 

X 

54 

X 

X 

X 

55 

X 

X 

X 

56 

X 

X 

57 

X 

X 

58 

X 

X 

X 

59 

60 

X 

X 

61 

X 

X 

X 

62 

X 

X 

63 

64 

X 

X 

65 

X 

X 

X 

66 

67 

X 

68 

X 

69 

X 

70 

X 

X 

71 

X 

72 

X 

73 

X 

X 

74 

X 

X 

X 

^ 

75 

X 

X 

X 

X 

76 

77 

X 

X 

X 

X 

X 

X 

78 

X 

X 

79 

X 

X 

80 

X 

X 

X 

X 

X 

X 

81 

X 

82 

X 

X 

83 

X 

X 

X 

84 

X 

X 

X 

X 

X 

X 

85 

X 

X 

X 

X 

X 

X 

86 

X 

X 

87 

X 

X 

X 

X 

X 

X 

42 

37 

40 

35 

33 

25 

41 

The  Leoond  for 
tho  Varioblej  in 
Toble     4-6 

'X'  lndicol«» 
CorracI   Pfediclion 


Toble  4-7  Cont. 
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Household 
Number 

Set    of    ExDianclory    Vorioblej 

FLA,AP3 

TL.PRK 

TL,AP3 

TA.PRK 

TA.TOT 

TA,AP3 

1 

X 

2 

X 

X 

X 

X 

X 

3 

X 

X 

X 

X 

X 

4 

5 

6 

X 

X 

X 

7 

e 

9 

X 

10 

II 

X 

X 

X 

X 

X 

12 

X 

X 

13 

X 

14 

15 

X 

X 

X 

X 

X 

X 

16 

X 

17 

X 

X 

X 

X 

X 

X 

18 

X 

X 

X 

X 

X 

X 

19 

X 

X 

X 

X 

X 

X 

20 

X 

21 

X 

X 

X 

X 

X 

22 

X 

X 

X 

X 

X 

X 

23 

X 

X 

X 

X 

X 

X 

24 

X 

25 

X 

26 

X 

27 

X 

X 

28 

X 

X 

29 

X 

X 

X 

X 

X 

30 

X 

X 

X 

X 

31 

X 

X 

X 

32 

X 

X 

X 

33 

X 

X 

X 

X 

X 

X 

34 

X 

X 

35 

X 

X 

X 

X 

X 

X 

36 

37 

X 

X 

38 

X 

X 

39 

40 

X 

X 

X 

X 

X 

41 

X 

42 

43 

X 

44 

X 

X 

X 

The  Leqend  for 
Iho  Voriobles  In 
Tobla     4-6 

X  Indicates 
Correct  Prediction 
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Total  Correct 
Predictions 


Housettold 
Number 

Set    of    Explanatory    Voriables 

FLA.AP3 

TL.PRK 

TL.APJ 

TA.PPK 

TA.TOT 

TA,AP3 

45 

X 

X 

46 

X 

47 

X 

X 

X 

48 

X 

X 

X 

49 

X 

X 

50 

X 

X 

X 

X 

X 

X 

51 

X 

X 

52 

53 

X 

X 

X 

X 

X 

X 

54 

X 

55 

X 

56 

X 

X 

57 

X 

X 

X 

58 

X 

X 

X 

X 

59 

60 

X 

X 

X 

X 

61 

X 

X 

X 

X 

62 

X 

X 

63 

64 

X 

X 

65 

X 

X 

X 

X 

X 

66 

67 

X 

68 

X 

X 

X 

X 

X 

69 

X 

70 

X 

X 

X 

X 

71 

72 

73 

X 

X 

74 

X 

X 

75 

X 

X 

X 

76 

77 

X 

X 

X 

X 

X 

X 

78 

X 

X 

79 

X 

X 

80 

X 

X 

X 

X 

X 

X 

81 

X 

X 

X 

X 

62 

X 

X 

83 

X 

X 

X 

X 

X 

X 

84 

X 

X 

X 

X 

X 

X 

65 

X 

X 

X 

X 

X 

X 

86 

X 

X 

X 

87 

X 

X 

X 

X 

X 

X 

43 

38 

41 

39 

36 

40 

The  Legend  for 
the  Variables  In 
Table    4-6 

'X' Indicates 
Correct   Prediction 
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Household 
Number 

Set  of    Etplnnatory  Voricbles                                  | 

PRK.DAL 

TOT, DAL  i7  0T,flP3 

PRK,AP3 

TL.FLA 

TL,n.A,AP3 

1 

X 

X 

X 

2 

X 

X 

X 

3 

X 

X 

X 

4 

5 

6 

X 

X 

X 

X 

7 

8 

X 

X 

9 

10 

II 

X 

X 

X 

12 

X 

X 

X 

13 

14 

15 

X 

X 

X 

X 

X 

X 

16 

X 

X 

X 

X 

17 

X 

X 

X 

X 

X 

X 

18 

X 

X 

X 

X 

X 

X 

19 

X 

X 

X 

X 

X 

X 

20 

X 

X 

X 

X 

21 

X 

X 

X 

X 

X 

X 

22 

X 

X 

X 

X 

X 

X 

23 

X 

X 

X 

X 

X 

X 

24 

25 

26 

27 

28 

29 

X 

X 

X 

X 

30 

X 

X 

X 

X 

31 

X 

X 

X 

X 

32 

X 

X 

X 

X 

33 

X 

X 

X 

X 

34 

X 

X 

X 

X 

35 

X 

X 

X 

X 

36 

X 

X 

37 

X 

X 

X 

X 

38 

39 

X 

X 

40 

X 

X 

X 

41 

X 

X 

X 

42 

43 

44 

X 

X 

X 

X 

X 

The  Legend  for 
Iho  Voriobles  in 
Tobis     46 

X  Indicate) 
Correct   Prediction 


Tobl«  4-7  Cont 
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Household 
Number 

Set    of    Explonoto 

Y    Voriobles 

PRK.DAL 

TOT,  DAL 

T0T,AP3 

PRK,AP3 

TL.FLA 

TLPJ^APZ 

45 

X 

X 

X 

46 

X 

X 

X 

47 

X 

X 

X 

X 

X 

48 

X 

X 

X 

X 

X 

49 

50 

X 

X 

X 

X 

X 

X 

51 

52 

53 

X 

X 

X 

X 

X 

X 

54 

X 

55 

X 

X 

56 

57 

X 

X 

X 

X 

X 

X 

58 

X 

X 

X 

X 

X 

59 

60 

X 

X 

61 

X 

X 

X 

X 

X 

62 

63 

64 

65 

X 

X 

X 

X 

X 

66 

67 

68 

X 

X 

X 

X 

X 

69 

70 

X 

X 

X 

71 

72 

73 

74 

X 

X 

75 

X 

X 

X 

X 

X 

X 

76 

77 

X 

X 

X 

X 

X 

X 

78 

X 

79 

80 

X 

X 

X 

X 

X 

X 

81 

X 

X 

82 

83 

X 

X 

X 

X 

X 

84 

X 

X 

X 

X 

X 

X 

85 

X 

X 

X 

X 

X 

X 

86 

X 

X 

X 

X 

X 

X 

87 

X 

X 

X 

X 

X 

X 

Totol  Correct 
Prodictions 

35 

39 

37 

36 

43 

42 

The  Legend  for 
the  Voriobles  in 
Tobia     4-6 

'X' Indicates 
Correct    Prediction 
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achieved,  as  shown  in  Table  4.7.  Considering  the  correct  predictions 
made  by  single  variables,  one  gets  the  impression  that  if  Line  Haul 
Travel  Time  (TL)  and  Floor  Area  (FLA)  act  independently  the  two  of 
them  in  a  single  model  should  produce  more  correct  predictions.  It 
was  anticipated  that  since  the  correlation  between  these  two  variables 
is  significant  such  an  optimistic  expectation  would  not  materialize. 
This  can  be  verified  in  Table  4.7.  Similar  remarks  can  be  made 
about  Line  Haul  Travel  Time  (TL)  and  Airline  Distance  (DAL)  acting 
in  the  same  model  since  these  two  variables  are  significantly  corre- 
lated and  do  not  always  make  the  same  actual  predictions  when  they 
act  separately. 

The  approachability  measures  turned  out  to  be  in  categories  2 
or  3  (i.e.  medium  or  good  approachability)  because  no  household  in 
the  subsample  taken  had  a  rating  of  1  (poor  approachability)  with 
respect  to  its  approachability  to  any  of  its  alternative  destinations. 
The  models  developed  to  combine  the  approachability  measures  as 
dummy  variables  with  other  variables  do  give  the  same  results  as 
those  obtained  in  the  absence  of  the  approachability  measure.  This 
indicates  that  the  ease  of  approach  to  the  shopping  centers  did  not 
really  pose  any  serious  problems  for  the  households  concerned.  This 
confirms  the  fact  that  no  particular  effort  was  reported  to  have  been 
made  by  the  households  to  avoid  congestion  by  making  shopping  trips 
during  off-peak  periods .  Another  contributing  factor  is  that  the 
subsample  covered  households  from  some  two  or  three  adjacent  sectors 
which  probably  resulted  in  the  households  involved  being  relatively 
located  with  respect  to  the  highway  network  in  a  similar  manner. 
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Table  4.8.  Sunmary  of  Conditional  Logit  Model  Estimation  Results 


Explanatory  -2  x  Log  Like- 

lihood 
Variables^   Parameter  Estimate^        Likelihood      Ratio 


TL 

-0.0607  (0.0239 

-117.22 

6.787 

TA 

-0.0291  (0.0233) 

-119.81 

1.5945 

FLA 

0.00165  (0.000414^ 

-111.98 

1.726 

PRK 

0.0001788  (0.0000593) 

-115.90 

9.415 

TOT 

0.0145  (0.00413) 

-114.28 

12.664 

DAL 

0.0278  (0.0372) 

-120.33 

0.547 

FLA 
DAL 

0.00164  (0.000418) 
0.00538  (0.0410) 

-111.97 

17.28 

FLA 
AP3 

0.00152  (0.000418) 
-0.628  (0.408) 

-110.66 

19.89 

TL 
PRK 

-0.0686  (0.0241) 
0.000199  (0.0000606) 

-111.58 

18.055 

TL 

AP3 

-0.0614  (0.0235) 
-0.872  (0.391) 

-114.26 

12.70 

TA 
PRK 

-0.0393  (0.0239) 
0.000192  (0.0000604) 

-114.51 

12.20 

TA 
TOT 

-0.0381  (0.0241) 
0.0152  (0.00417) 

-112.99 

15.23 

TA 
AP3 

-0.03207  (0.023) 
-0.872  (0.392) 

-116.86 

7.492 

*The  Legend  for  the  Explanatory  Variables  is  on  Table  4.6 
Standard  Deviation  of  Parameter  in  Parenthesis 
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Table  4.8,   Cont. 


Explanatory  -2  x  Log  Like- 

lihood 
Variables^   Parameter  Estimate"        Likelihood      Ratio 


PRK 

0.000176  (0.0000598) 

-120.61 

9.537 

DAL 

0.0139  (0.0397) 

TOT 

0.0146  (0.00415) 

-114.02 

13.168 

DAL 

0.0285  (0.0397) 

TOT 

0.0146  (0.00417) 

-111.51 

18.185 

AP3 

-0.875  (0.403) 

PRK 

0.000158  (0.0000585) 

-114.03 

13.149 

AP3 

-0.726  (0.401) 

"* 

TL 

-0.0844  (0.0251) 

-105.79 

29.627 

FLA 

0.00197  (0.000437) 

FLA       0.00182  (0.000449) 

TL        -0.0798  (0.0249)  -105.02        31.168 

AP3 -0.489  (0.410) 

^The  Legend  for  the  Explanatory  Variables  is  on  Table  4.6 
^Standard  Deviation  of  Parameter  is  in  Parenthesis 
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Models  that  contain  three  or  more  variables  from  the  set  available 
did  not  make  any  difference,  as  expected,  for  reasons  given  above. 

It  has  been  argued  [16]  that  since  actual  human  shopping  behav- 
iour may  be  random  one  would  expect  one  out  of  four  correct  predic- 
tions to  be  made  on  the  average,  considering  an  average  of  four 
alternatives  for  each  household.  For  all  the  models  discussed  in 
this  chapter,  the  percentage  correct  prediction  is  more  than  25%  but 
less  than  50%.   In  a  previous  use  of  the  multinomial  logit  model 
by  Charles  River  Associates  [16]  on  zonal  shopping  destinations  in 
a  mode  choice-flavored  context,  a  40%  correct  prediction  of  actual 
destinations  was  achieved.   It  is  difficult  to  come  up  with  predic- 
tions that  can  be  effectively  aggregated  if  there  is  no  provision  to 
account  for  50%  of  the  sample  in  the  choice  of  actual  destinations. 
If  actual  destination  choice  was  really  random  it  would  be  better  to 
randomly  assign  destinations  in  the  first  place.  Random  choice  is 
only  possible  if  all  alternatives  are  equally  likely.  It  is  known 
that  shopping  destination  choice  is  often  based  on  characteristics 
of  the  shopping  areas  which  are  not  the  same  for  all  alternatives. 
This  point  is  discussed  in  connection  with  the  attitudinal  work  on 
shopping  destination  choice  presented  in  Chapter  6.  The  impractical- 
ity  of  defining  destination  choice  sets  of  varying  size  and  the  lack 
of  a  provision  for  aggregation  in  the  use  of  the  conditional  logit 
model  have  been  discussed  in  Chapters  2  and  3.  An  additional  factor 
that  lessens  the  usefulness  of  the  logit  model  for  directly  predict- 
ing actual  destination  choices  is  its  low  predictive  accuracy  as 
discussed  in  this  chapter. 
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An  alternative  modelling  procedure  presented  theoretically  in 
Chapter  5  with  example  application  in  Chapter  7  predicts  functional 
classes  of  destination  alternatives.   If  further  variables  are  avail- 
able one  can  go  to  the  stage  of  using  a  choice  model  to  predict  actual 
choices  within  functional  alternatives.   It  is  pointed  out  in  Chapter 
5  that  at  this  second  stage  simulation  is  considered  the  most  practi- 
cal way  of  handling  the  problem  given  the  present  quality  of  data 
available  on  spatial  choices  and  the  expectation  that  alternatives 
within  functional  destination  categories  are  more  equally  likely. 
The  alternative  procedure  will  use  ranges  of  variables  as  categories 
and  also  account  for  some  of  the  interactions,  the  presence  of  which 
forfeited  improvements  that  could  have  been  achieved  in  predictabili- 
ty. . 

Summary 

The  multinomial  logit  model,  increasingly  becoming  a  tool  in 
disaggregate  modelling  of  urban  travel  demand,  was  found  to 
yield  low  correct  prediction  levels  of  actual  shopping  destination 
choices.  A  38  to  49  percent  range  of  correct  predictions  was 
found  regardless  of  whether  one  key  variable  or  a  combination  of 
variables  is  used  as  an  explanatory  set  of  variables.   Some  of  the 
key  variables  usedwere  Line  Haul  Travel  Time,  Retail  Floor  Area  and 
the  Parking  Spaces  available.  Ordinarily,  one  would  have  expected 
the  models  with  independently  acting  variables  to  do  better.   This 
was  not  the  case  mainly  because  some  of  the  variables  (eg.  approach- 
ability)  really  had  little  effect  on  destination  choice  by  virtue  of 
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the  background  of  the  sample  used.  Significant  correlation  between 
variables  was  found  to  reduce  the  effectiveness  of  models. 

It  is  emphasized  out  that  low  predictive  accuracy  is  not  the 
only  serious  problem  with  the  generalized  multinomial  logit  procedure 
discussed  in  Chapter  3.  Other  shortcomings  include  destination  choice 
set  definition  and  aggregation  as  discussed  in  Chapters  2  and  3. 
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CHAPTER  5 

DESTINATION  CHOICE  MODELLING  WITH  PRIOR 

CONSIDERATION  OF  A  SAMPLING  FRAMEWORK  FOR 

AGGREGATE  PREDICTION 

The  formulation  and  estimation  of  parameters  for  the  multinomial 
response  relation  model  are  first  introduced  in  this  chapter.  The 
adaption  of  the  multinomial  response  approach  to  the  analysis  of  the 
responses  of  market  segments  to  functional  travel  behavior  alternatives, 
after  the  definition  and  the  statistical  implications  of  the  classifi- 
cation involved  have  been  discussed,  is  provided.   It  is  maintained 
that  such  a  use  of  the  multinomial  response  approach  can  foster  aggregate 
prediction  since  market  segmentation  according  to  socio-economic  and 
population  characteristics  from  readily  available  sources  can  be  tied 
to  the  sample  survey  for  an  operational  transportation  study.   The  use 
of  the  multinomial  response  procedure  to  provide  estimates  of  the 
proportions  of  market  segments  that  respond  to  functional  travel 
alternatives  for  use  in  separate  sampling  logistic  discrimination  for 
disaggregate  analysis  is  discussed.   The  recently  developed  logistic 
discrimination  procedure  has  an  advantage  over  classical  procedures 
since  it  allows  the  handling  of  a  combination  of  categorical  and  contin- 
uous variables.   It  is  argued  that  separate  sampling  at  activity  center- 
market  segment  interfaces  be  adopted  for  more  efficient  sampling  for 
disaggregate  analysis. 
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Large  samples  (efficiently  collected  not  to  be  excessive)  are 
recommended  in  operational  studies  so  that  proper  statistical  inferences 
can  be  made  on  goodness  of  fit  measures  and  on  the  parameters  of 
probability  distribution  functions.  General  maximum  likelihood  esti- 
mation procedures,  the  assumptions  involved  and  the  associated 
asymptotic  distributions  of  parameter  estimators  are  extensively  dis- 
cussed to  support  the  call  for  large  sample  surveys.  [Multivariate 
statistical  testing  procedures  relevant  to  parameters  estimated  for 
probability  models  of  travel  behavior  are  presented  in  Chapter  7] 

Most  importantly  the  presentation  shows  how  recently  developed 
statistical  tools  can  allow  the  modelling  of  travel  behavior  without 
very  sophisticated  data  collection  and  measurement  scales.   There  is 
more  reliance  in  survey  design  and  systematic  structuring  of  statistical 
models. 

Multinomial  Response  Law  and  its  Estimation 

Suppose  we  have  subjects  under  different  experimental  conditions 
reacting  to  categories  of  some  response  classification.  We  let  the 
number  of  categories  in  the  response  classification  be  m  >^  2.  The 
number  of  subjects  under  condition  or  market  segment  j  (j  =  l,2,...,n) 
who  fall  in  category  h  of  the  classification  is  r.,  where 


m 
h= 

Assuming  a  random  selection  of  subjects,  the  probability  of  response 


III 

y     r.,    =  N. 
.=1     J^         J 


frequencies  r.,    is  given  by: 


n  N.!  r.,r._         r. 

Jl    D   J2        p   jm 


I     f-^n r  P.^^   P.^/...P.-""  5.1 

^,    r.,!r.^!...r.    !      jl       j2  jm 


j=l     jl"    j2  jm' 
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Suppose  we  have  experimental  variables  that  can  be  used  to  develop 

functions  defining  the  response  probabilities.  A  generalization  of  the 

binomial  logit  implicit  in  the  work  of  Luce  [39]  and  Mantel  [94]  can 

express  the  response  probabilities  in  terras  of  multivariate  logits 

Z.,,  Z. „,..., Z.   as  follows: 
jl       j2      jm 

Z 
Pjh  =  e  J"/D^  h  =  l,...,m 

where  P.,  is  the  probability  that  a  member  of  market  segment  j 
responds  to  the  hth  response  category 


■'   h=l 


t   q 


ra   Z. 


e 


jh 


'"'  'jh  =  ij  J^  ^jk  ^£  -£h 

X.,  ,  the  jkth  element  of  an  nxq  matrix  X  defined  below,  represents  the 
kth  effect  for  the  market  segment  j.   a.,  is  the  ihth  element  of  a  ^xm 
A,  discussed  below,  that  corresponds  to  the  £th  effect  of  the  hth 
response  category.   3,   is  the  parameter  depicting  the  effect  corre- 
sponding to  the  kth  market  segmentation  effect  and  the  £th  response 
categorization  effect. 

A  linear  model  expressing  the  logits  in  terms  of  the  physical 
variables  must  provide  for  a  structure  of  the  categories  in  the 
response  classification.   This  structure  may  be  a  result  of  a  crossing 
or  nesting  of  mutually  exclusive  classifications  of  the  same  subjects. 
We  define  a  model  that  is  sufficiently  general  to  include  both  a 
structured  response  classification  and  multiple  experimental  variables 
as: 
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hi        \2 


hi        h2    ■ 


\l        \l 


.    z 


Im 


■2ni 


.    z 


nm 


""ll        ^2    •'•    ^Iq 
^21        ^22    ■'■    hq 


X     ,  X     -      ...     X 

nl  n2  nq 


'  6 

11        ^1^ 

hi 


12    "•  •    -it 

^22    • ■  •    ^2t 


6  «         .         3 

qi     ^q2         qt 


a  a, ....    a. 

11  12  Im 


^21        ^22 


2m 


at,        at_    ...    a^ 
1  2  tin 


which  is   equivalent  in  matrix  notation  to 

Z  =  X   ■   B   •   A 

nxm   nxq   qxt   txra 


5.2 


Matrix  X  accounts  for  variation  in  the  response-probabilities 
across  the  experimental  conditions, X  is  usually  not  of  full  column 
rank.   If  the  rank  of  X  is  r  <_  q  it  is  necessary  to  reparameterize  the 
model  as 


^  =  K   ■   L 
nxr   rxq 


5.3 


where  the  rows  of  L  are  coefficients  of  linearly  estimable  functions  of 

X 

the  parameters  in  the  columns  of  B.  i.e.  rank  ['-]  =  rank  [X].  We  thus 

have 


K  =  XL'CLL') 


5.4 


Examples  of  the  construction  of  K  are  discussed  in  Bock  [86] .   X  can  be 
referred  to  as  the  physical  part  of  the  model .   K  is  a  column  basis  of 
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Matrix  A  in  5.2  accounts  for  the  variation  of  response  probabilities 
over  response  categories,  A  is  also  usually  of  deficient  rank.   If  rank 
(A)  is  s  _<  t  then  a  reparameterization  similar  to  5.3  is 

A  =  S   •  T  5.5 

txs   sxm 

where  rank  S  =  s  and  the  columns  of  S  are  linearly  dependent  on  those 

of  A,  we  have 

rank   [A]   =  rank   [A,S] . 

Again  we  have 

T  =    (S'S)"-^   S'A  5.6 

A  can  be  called  the  response  part  of  the  model,  for  which  T  is  a  row 
basis. 

When  X  and  A  are  of  deficient  rank,  reparameterization  of  the  model 
as  discussed  is  necessary  to  make  the  matrix  of  second  derivitives  of 
the  log  of  the  likelihood  function  non-singular  if  maximum  likelihood 
estimation  via  a  suitable  gradient  technique  is  used. 

The  reparameterized  full  rank  model  is 

Z  =  K(LBS)T  =   K   r   T  5.7 

nxr  rxs  sxm 
where 

r  =  LBS 

It  is  then  the  rxs  matrix  of  parameters,  T,   that  is  estimated.  The 
elements  of  r  in  statistical  terms  will  be  called  effects. 
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r  will  be  estimated  using  a  maximum  likelihood  estimation  procedure 
with  the  Newton-Raphson  gradient  technique.  The  log  of  the  likelihood 
function  given  in  5.1  will  be  denoted  by  L  . 

q 

The  following  are  defined 


r.  = 
-J 


"jl 


J2 


jm 


and  P. 
-J 


'i2 


jm 


5.8 


We  denote  the  elements  of  r  as  y,,.     The  rsxl  vector  of  first  derivatives 

kh 

of  L  with  respect  to  the  elements  of  r  considered  as  elements  of  an 

q 

rsxl  vector  in  which  the  column  subscript  varies  first  is  given  by 
[86] 


n 


(D  =  I       T   (r.  -  N.  p.)  (g)  K. 


i=l  sxm 


J  ~y 


5.9 


mxl 


'J 
rxl 


where  ®  denotes  a  Kronecker  product  and  K.  is  the  ith  row  of  K  written 

-J        ^         - 

as  a  column.   (A  Kronecker  product  is  defined  in  a  later  section) 
Defining  the  following  mxm  matrix: 


W. 
-J 


P.-(1-P.J  -  P., P. 7 
Jl    :i     Jl  j2 


-P. -P., 
32  jl 


-P.  P., 

jm  jl 


p.,(i-p.,) 


-  P.  P., 
jm  j2 


-P., P. 
Jl  jm 


-P., P. 
j2  jm 


P.  (1-P.  ) 
jm    jm-^  J 


5.10 


rw.  is  the  variance-covariance  matrix  of  the  estimators  of  P.,  ] 
•■-J  jh 


The  rsxrs  matrix  of  second  derivatives  of  L  is  then: 

q 
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n 

VCr)   =  -  y  N.  TW.T'  0K.K.'  5.11 

rs  X  rs    i=l 

sxs      rxr 

For  the  likelihood  function  to  be  maximiLed  the  first  derivative  of  L 

q 

with  respect  to  y       must  be  zero,  i.e. 

9  (D  =  9  5.12 

rsxl   rsxl 

For  solutions  of  equation  (5.12)  to  be  unique  the  matrix  of  second 

derivatives  of  L  must  be  negative  definite.   Since  W.  is  the  variance- 
-q  ^  ~j 

covariance  matrix  of  the  multinomial  probabilities,  it  is  positive  semi- 
definite  with  only  one  zero  root.  The  characteristic  vector,  a, 
corresponding  to  the  zero  root  has  each  component  as  unity.   The  matrix 
TW.T'  is  therefore  positive-definite  so  long  as  s  <  m,  T  is  of  full  rank 
and  T  does  not  have  a  vector  a  as  a  row. 

The  matrix  K.K.'  is  positive-semidefinite  and  of  rank  one.   The 
Kronecker  product  of  TW.T'  and  K.K.'  can  be  established  to  be  of  rank 
s  [95].   For  n  >^  r  and  K  of  full  rank,  the  sum  over  j  of  these  Kronecker- 
products  will  be  positive  definite.   The  matrix  ^(r)>  being  the  negative 
of  a  positive  definite  matrix,  is  therefore  negative  definite  for  all 
finite  P.  A  Newton-Raphson  solution  of  the  likelihood  equations  (5.12) 
shoyld  therefore  converge  to  the  maximum  likelihood  estimates  from  any 
finite  initial  values. 

Let  r.  be  the  trial  value  at  the  ith  iteration.   From  5.7  the 
-1 


trial  logits  are 


[ZJ;^]  =  ?i  =  KJ.  5.13 

nxn 
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and  the  trial  probabilities  are 

*     -1     ih 
P.  =  d/  [  e  J"  ]  5.14 

-'     ■"•   nxm 
nxm   nxn 

where  D.  is  a  diagonal  matrix  whose  elements  are  the  sum  of  elements  in 

ih  * 

the  rows  of  [e  -^   ].   The  elements  in  the  rows  of  P.  are  the  components 

of  the  vector  P.  defined  for  equation  C5.9)  and  used  for  equation  (5.10) 

The  adjustment  vector  is 

i 

-i  "  -^"^(^i^'P  tti^  5.15 

where  6.  is  an  rsx  1  vector  whose  elements  constitute  6,.   (adjustments 
-1  -kh  ^  ■' 

to  r^,  )  where  the  subscript  h  varies  first). 

The  adjustments  are  added  to  corresponding  elements  of  r.  to  get 
improved  estimates  up  to  a  specified  tolerance  level. 

A  convenient  goodness-of-fit  test  for  the  overall  model  is  given 

by  the  likelihood  ratio  statistic  [49,  89] 

n   m 

Jk  -^e  "j  'jk'^jk 


X^  =   -2     I        I     T,^   log^  N.  P,^/r.^         5.16 
j=l  k=l 


P.,    are  as  computed  in  equation  (5.14). 

The  associated  number  of  degrees  of  freedom  is  n(m-l)  -  rs. 
'More  extensive  discussion  on  inferences  on  the  parameters  estimated 
will  be  provided  in  the  context  of  large  sample  maximum  likelihood 
estimators  in  a  later  section.  Hypothesis  testing  with  the  multinomial 
response  model  will  be  discussed  during  its  application  in  Chapter  7. 
The  following  section  relates  the  multinomial  response  model  to  desti- 
nation choice  modelling  for  different  market  segments,  for  which  the 
empirical  analysis  in  Chapter  7  will  be  carried  out. 


72 


A  Specific  Applied  Discussion  of  the  Multinomial  Response  Relation 
Procedure 

In  the  discussion  following  equation  5.1  some  multivariate  logits, 
Z.,  ,  were  introduced  whose  elements  form  the  matrix  Z  given  in  equation 
5.2  as 

Z  =  X  •   g   •  A      "  5.2 

nxm   nxg   gxt   txm 

For  expediency  in  the  ensuing  theoretical  development  matrices 
X,  B,  and  A  were  discussed  in  briefly  general  terms.   Starting  from 
equation  5.2,  a  presentation  is  given  of  the  application  of  the  multi- 
nomial response  relation  technique  to  the  special  kind  of  problems  that 
will  be  of  interest  in  this  research. 

A  simple  case  where  all  the  matrices  involved  in  equation  5.2  are 
2x2  matrices  is  considered  for  which 

pii  ^i2i  r^ii  '12]  pii  ^12]  pii  ^2! 

Lz^i        Z^J      Lx^i        x^J       Le^i        322-1         Ul        ^22-1 

The  above  expression  implies: 


^11  "  ""ll  ^11  ^11  ^  ""12  ^21  ^1  *  ""ll  ^12  ^21  *  ""n  ^22  ^21 

^12  =  ^1  ^11  ^2  ""  ''12  ^21  ^12  *  ''ll  ^12  ^22  ^  '^12  ^22  ^22 

^21  "  ""21  ^11  ^11  ^  "^22  ^21  ^1  *   ""21  ^12  ^21  ^  '^22  ^22  ^21 

^22  "  ^21  ^11  ^12  ■"  ^22  ^21  ^22  "^  ^21  ^12  ^22  "^  ^22  ^22  ^22 

We  can  therefore  deduce  the  expression  for  the  multivariate  logit  Z., 
as 
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t   q 

^jh  =   ^,  X    ""jk  \l  ^Hh  '^^  1.2. ..,.m 

■'  =1  k=l  ■' 

j  =  1,2, ...,n 


Z.,  can  also  be  expressed  as 


-i  -  ~h 
Ixq  qxt  txl 


i.e.  each  multivariate  logit  Z..  corresponding  to  the  jth  market 
segment  and  ith  response  category  can  be  written  as  the  product  of  row 
j  of  X  (x.),  the  parameter  matrix  P  and  column  h  of  A  (a,).   Each 
element  3  ^  of   is  the  parameter  corresponding  to  the  kth  column  of  X 
and  the  ith   row  of  A  which  are  described  below. 

Each  column  of  X  represents  an  effect  for  the  design  model  that 
it  depicts  and  the  same  applies  for  each  row  of  A.  An  element  of  the 
row  vector  x.  is  0  or  1  according  as  the  effect  corresponding  to  its 
column  position  is  absent  or  present,   the  iith  row  element  of  the 
column  vector  a,  is  0  or  1  according  as  the  effect  associated  with  the 
th  row  is  absent  or  present. 

The  matrices  X  and  A  can  each  represent  a  crossed  or  nested  design. 
In  the  problems  discussed  in  this  study  we  consider  our  market  segments 
separately  in  which  case  we  need  not  be  concerned  with  representing 
any  design  with  X.   The  A  matrices  in  the  applications  in  this  research 
are  all  for  crossed  designs.  We  shall,  however,  discuss  a  hypothetical 
example  in  which  each  of  the  X  and  A  m.atrices  represents  a  crossed 
design. 

The  matrix  X  depicts  the  experimental  design  configuration  for  a 
market  segmentation  process  whilst  the  matrix  A  represents  the 
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corresponding  configuration  for  the  response  classification.  An 
example  of  factors  w  and  S  corresponding  to  attitudinal  descriptors 
"Wide  selection  of  Goods"  and  "Shortest  Travel  Time"  respectively. 
Suppose  there  are  2  categories  for  WCW. ,  W^)  and  3  classes  for  S 
(S^ ,  S-,  S_)  where  the  higher  the  category  the  more  important  the 
descriptor  is  to  a  respondent  in  an  attitudinal  survey.  A  basic  2x3 
classification  for  the  definition  of  X  is: 


=1 

^2 

^3 

"l 

1 

2 

3 

"2 

4 

5 

6 

If  the  factors  W  and  S  act  independently,  an  entry  in  cell  j 
(j  =  1,...,6)  of  the  above  table  is  depicted  by  the  main  effects 
corresponding  to  its  row  and  column  coordinates.  With  each  row  of  X 
corresponding  to  a  cell  in  the  classification  table,  it  is  formed  as 
follows: 


X  = 


Main 

Effects 

Cell  No, 

y        w,        W^ 

S, 

S^ 

S, 

*^            1            2 

1 

2 

3 

1          1            0 

1 

0 

0" 

1 

1          1            0 

0 

1 

0 

2 

1          1           0 

0 

0 

1 

3 

1          Q           1 

1 

0 

0 

4 

1          0            1 

0 

1 

0 

5 

10           1 

0 

0 

1 . 

6 

Cell  5  is  affected  by  the  general  mean  and  the  main  effects  W-  and 
S„  so  entries  in  X  for  cell  5  are  1  for  these  effects,  all  others  being 
0. 
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It  is  noticed,  as  was  indicated  previously,  that  each  colmnn  of 
X  represents  an  effect  and  each  row  of  X  corresponds  to  a  market 
segment  or  classification  cell. 

What  is  meant  by  the  column  rank  of  X  in  terras  of  the  problem 
under  discussion  is  the  number  of  independent  effects  we  can  identify. 
If  we  have  a  sample  and  observe  a  portion  of  it  as  responding  to  the 
category  W  we  know  that  the  remainder  of  the  sample  respond  to  category 
W^.  Similarly,  if  we  know  the  number  of  responses  to  any  two  of  S  ,  S  , 
S_,  we  can  obtain  the  number  of  responses  to  the  third  category  of 
factor  S  by  simple  subtraction.   Since  we  can  only  estimate  1  W  category 
effect  relative  to  the  other  and  2  category  effects  of  factor  S  relative 
to  a  third,  we  say  that  the  column  rank  of  X  is  3.  The  practice  adopted 
in  this  study  is  to  consider  the  effects  relative  to  the  first  or  last 
category  of  each  classification. 

If  there  is  an  interaction  effect  between  one  W  category  and  another 
of  the  S  factor,  this  effect  is  reflected  in  the  row  of  X  corresponding 
to  the  cell  defined  by  the  categories  in  question.  AW,  S  interaction, 
therefore,  will  have  an  entry  of  1  corresponding  to  cell  1.   We  then  have 


W, 


W, 


1        1 

0 

1        1 

0 

1        1 

0 

1            0 

1 

1            0 

1 

1            0 

1 

Eff 

sets 

^1-^1 

h 

^2 

s 

Interact 

1 

0 

0 

l" 

0 

1 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0. 

1 

2 
3 
4 
5 
6 


y  denotes  a  general  mean  effect 
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The  column  rank  of  X  becomes  4  with  the  addition  of  the  interaction 
effect . 

Since  it  has  been  discussed  why  one  can  only  estimate,  say,  W 
relative  to  W-  and  S  and  S^  relative  to  S_,  in  the  modified  or 
reparameterized  problem,  the  1^1  interaction  corresponds  to  a 
(W.  -  W-)  by  (S  -  S,)  interaction. 

All  the  discussion  on  the  design  matrix  X  so  far  and  subsequent 
developments  regarding  it  can  be  easily  adapted  for  the  matrix  A  if 
the  roles  of  rows  and  columns  are  interchanged: 

(i)   We  speak  of  the  rows  of  A  as  representing  the  main  effects 
of  the  categories  of  classification. 
.  (ii)  Each  column  of  A  corresponds  to  a  cell  of  the  classification 
it  defines. 
(iii)  The  row  rank  of  A  is  the  sum  of  the  independent  effects  of 
the  categories  of  classification  that  can  be  defined  and 
the  number  of  estimable  interactions. 
A  hypothetical  example  is  considered  where  the  response  classifi- 
cation categories  are  defined  by  the  crossing  of  2  classes  of  Retail 
Floor  Area,  F^  and  3  categories  of  Airline  Distance,  D  where  we  have  the 
following  table: 


D, 

D^ 

D, 

1 

2 

3 

1 

1 

2 

3 

2 

4 

5 

6 

An  A  matrix  that  considers  an  F  ,  D^  interaction  takes  the  follow- 


ing form: 
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Cell  No, 

1 

2 

3 

4 

5 

y 

'l 

1 

1 

1 

1 

Main 

^1 

1 

1 

1 

0 

0 

Effects 

^2 

0 

0 

0 

1 

1 

^1 

1 

0 

0 

1 

0 

^2 

0 

1 

0 

0 

1 

■^3 

0 

0 

1 

0 

0 

Interact 

ion 

F,,D^ 

1 

0 

0 

0 

0 

=  A 


y  denotes  a  general  mean  effect 

In  a  reparameterized  problem  in  which  we  estimate  effects  relative 
to  the  last  category  we  shall  estimate  (F  -  F  ),  (D  -  D  )  and  (D2  -  D,) 
main  effects  and  an  (F^  -  F„)  by  (D^  -  D_)  interaction.  The  row  rank 
of  A  will  therefore  be  4.  We  notice  that  this  A  matrix  with  one  inter- 
action is  the  transpose  of  the  X  matrix  with  similar  effects  previously 
discussed. 

Discussions  in  connection  with  equations  (5.1)  to  (5.6)  led  to  the 
reparameterized  model  that  expresses  the  matrix  of  multivariate  logits, 
Z  as: 


Z  =  K   r   T      as  given  in  equation  5.7 
nxr  rxs  sxm 

The  multivariate  logits  Z.,  form  the  elements  of  Z.  The  probability 

that  a  member  of  market  segment  j  responds  to  category  h  in  the 

response  classification  is  given  by 

Z. 


jh 


P. 


jh    Z.-    Z.„ 
jl     i2 
e    +  e  -^  +. .  .+e 


j  =  l,...,n 
h  =  1, .  . .  ,m 

z. 

jm 
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n  =  number  of  cells  in  the  classification  defining  X 

r  =  column  rank  of  X  as  previously  defined 

m  =  number  of  cells  in  the  classification  defining  A 

s  =  row  rank  of  A  as  previously  defined 

K  =  column  of  basis  of  X 

T  =  row  basis  of  A 

r  =  the  rxs  matrix  whose  elements  are  linear  functions  of  the 
original  model  parameters 

The  linear  functions  defining  the  elements  of  T   in  terms  of  the 
original  parameters  in  g  (See  equation  5.2)  are  not  of  immediate  im- 
portance for  the  estimation  process  since  we  can  consider  the  problem  as 
that  of  estimating  r.  Once  this  has  been  accomplished,  one  can  relate 
the  elements  of  r  to  those  of  §.  A  simple  example  of  this  procedure 
will  be  given  later.   In  terms  of  the  previous  examples  in  defining 
X  and  A 

n  =  6     r  =  4      m  =  6     s  =  4 

Since    X  =  Transpose  (A)  we  also  have  the  relationship 
K  =  Transpose  (T)  for  the  examples  discussed. 

The  basis  construction  for  crossed  designs  will  now  be  discussed 
in  terms  of  T. 

h  categories  are  supposed  in  a  way  of  classification  in  a  crossed 

design.   In  the  crossed  design  corresponding  to  the  above  A  matrix, 

h  =  2  for  Retail  Floor  Area  and  h  =  3  for  Airline  Distance. 

A  contrast  matrix  C,  ,  ,  is  defined  the  columns  of  which  contrast 

-hxh-1 

the  main  effect  of  the  last  class  with  each  of  the  remaining  classes: 
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-hxh-1 


1-1/h 

-1/h   . 

..  -1/h 

-1/h 

1-1/h 

..   -1/h 

• 

-1/h 

-1/h   . 

. .  1-1/h 

-1/h 

-1/h   . 

..   -1/h 

The  interpretation  of  C  is  given  below  with  the  Airline  Distance 
classifications  D^ ,  D^,  D_.  With  h  =  3  and  the  contrasts  being  relative 
to  the  last  category,  we  have 


2/3 

-1/3 

-3x2 

-1/2 

2/3 

-1/3 

-1/3 

Defining  a  row  vector  R  =  [D,D^D_]  whose  elements  are  airline  distance 
classification  effects. 


RC  =  [D, 


(DJ+D2+D2) 


(D  +D  +D,) 
2  3    ■" 


The  first  element  of  RC  can  also  be  expressed  as: 


(Dj  -  D3)  -  3[(D^  -  D3)  .  (D2  -  D3)  .  (D3  -  D3)] 

which  is  a  contrast  of  the  effect  D.  relative  to  D_  and  the  average  of 
all  the  category  effects  relative  to  D-.   Similar  remarks  can  be  made 
about  the  second  element  of  RC. 

For  the  hypothetical  example  in  connection  with  the  A  matrix 
definition,  a  C  matrix  for  Retail  Floor  Area  Classification  is 


:2xl 


{.:;;] 
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A  detailed  discussion  of  basis  construction  has  been  given  by 
Bock  [105] .  The  simplified  discussion  in  this  section  is  adequate 
for  the  work  in  this  study. 

The  basis  T  for  a  crossed  design  is  obtained  from  the  Kronecker 
product  of  columns  of  matrices  of  the  form  [lt^,C]  v,-here 


^hxh  ~  '•-h'--'   ^^  defined. 


The  Kronecker  product  of  matrices  U    and  V    is  defined  as 

mxn      pxq 


^iV 

u,,Y    . 

. .     u,    V 
in- 

^nV 

U22Y    . 

.     u-  V 
2n- 

y  ®  V  = 

np  X  nq 

^ul'~ 

%2Y    • 

.     u     V 

mn- 

For  the  2x3  classification  with  Retail  Floor  Area  and  Airline 
Distance  presented  above^  the  basis  T  is  formed  as  the  Kronecker 

products  of  appropriate  columns  of  the  following  matrices 


9i  = 


0  1 

1  1/2 
1  -1/2 


and 


0 

1 

2 

1 

2/3 

-1/3 

1 

-1/3 

2/3 

1 

-1/3 

-1/3 

where  Q  and  Q_  correspond  to  -Retail  Floor  Area  and  Airline  Distance 
classifications  ,  respectively.   Ke  denote  i,j  as  the  Kronecker  product 
of  Column  i  Ci  =  0,1)  of  g  and  Column  j  (j  =  0,1,2)  of  52- 
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fxsmples  of  what  effects  such  Kronecker  products  correspond  to 
when  the  basis  T  is  formed  are: 


Kronecker  Product 
Indices 

1.0 

0,1 

0,2 

1.1 


Effect 


F^  -  F_  Main  effect 

Dj  -  D^  Main  effect 

D-  -  D_  Main  effect 

(Fj^  -  F^)  X  (D  -  D_)  Interaction 


The  4x6  basis  T  corresponding  to  the  above  effects,  of  row  rank=4, 
whereby  the  Kronecker  products  form  rows  of  T,  is: 


f 


i4x6 


1/2 

1/2 

1/2 

-1/2 

-1/2 

-1/2 

2/3 

-1/3 

-1/3 

2/3 

-1/3 

-1/3 

1/3 

2/3 

-1/3 

-1/3 

2/3 

-1/3 

1/3 

-1/6 

-1/6 

-1/3 

1/6 

l/6_ 

In  all  the  applications  of  the  multinomial  response  relation 
technique  in  this  research,  different  market  segments  are  analyzed 
separately  instead  of  constructing  a  basis  K  whose  column  rank  is  equal 
to  the  number  of  market  segments.  The  problems  considered  thereby  are 
those  of  contingency  tables  formed  by  classifications  defined  for  the 
response  categories.  Another  reason  for  the  separate  analysis  is  the 
chance  allowed  for  certain  multivariate  tests  to  be  performed  on  the 
parameter  vectors  estimated  and  their  associated  covariance  matrices. 

The  computer  program  written  for  estimating  multinomial  response 
relation  model  parameters  has  provision  for  constructing  bases  K  and  T 
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for  up  to  4  ways  of  classification  for  crossed  designs.  The  estimation 
procedure  provides  parameters  which  constitute  the  elements  of  T   in 
equation  5.7  through  an  iterative  procedure  discussed  between  equations 
5.13  and  5.15.  The  initial  elements  of  T   are  assumed  to  be  all 
zero  and  convergence  of  estimates  is  usually  achieved  within  six 
iterations.  A  detailed  discussion  of  the  parameter  estimation  pro- 
cedure is  given  in  Appendix  A  with  an  example  problem. 

Multinomial  Response  Relation  Model  and  Destination  Choice  Modelling 

For  destination  choice  modelling  for  shopping  travel,  for  example, 
the  X  matrix  in  equation  5.2  can  be  defined  from  a  cross  classification 
scheme  that  defines  market  segments  according  to  factors  that  affect 
the  choice  of  functional  shopping  destinations  as  discussed  in  the 
previous  section.  The  functional  classification  of  shopping  areas  can 
be  on  the  basis  of  a  measure  of  attraction,  parking  availability  and 
the  relative  location  of  the  shopping  area.   Figure  5.1  illustrates 
boxes  P  and  Q  respectively  that  illustrate  classifications  that  define 
matrices  X  and  A  in  equation  5.2. 

In  the  general  case  X  will  have  as  many  rows  as  there  are  non- 
empty cells  in  box  P  and  as  many  columns  of  the  main  effects  and 
interactions  that  account  for  the  classification  in  box  P.   For  each 
cell  of  P  that  defines  a  row  j  of  X  the  column  element  is  1  or  0 
depending  on  whether  it  impinges  on  the  cell  or  not.  Further  examples 
of  the  construction  of  design  matrices  like  X  can  be  found  in  texts  on 
linear  models,  [92,  96].  The  construction  of  matrix  A  can  be  discussed 
in  terms  of  box  Q  as  has  been  done  for  matrix  X.   One  can  consider  cell 
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j  of  box  P  as  market  segment  j  and  cell  h  of  box  Q  as  the  functional 
shopping  destination  h.  M.  in  equation  5.1  is  the  number  of  indivi- 
duals or  households  that  belong  to  market  segment  j.  The  term  r., 
is  the  number  of  the  members  of  market  segment  j  that  choose  function- 
al shopping  destination  h.  The  term  P.,  is  the  probability  with  which 
a  member  of  market  segment  j  chooses  a  functional  destination  h.  Once 

matrices  X  and  A  and  r.,  and  N.  have  been  defined,  the  rest  of  the 

jh      J 

development  in  the  previous  section  is  independent  of  the  context  in 

which  the  multinomial  response  model  is  being  applied. 

The  major  requirements  for  applying  the  approach  discussed  in  this 

section  are  the  specification  of  the  geographical  distribution  of 

characteristics  on  which  the  travel  market  segmentation  is  based,  the 

geographical  distribution  of  destination  opportunities  and  factors  with 

which  they  can  be  classified  and  a  suitably  designed  sampling  scheme 

for  data  collection.   It  is  then  possible  to  estimate  r.,  ,  the  number 

^  jh' 

of  people  in  market  segment  j  that  select  functional  travel  category  h. 
The  number  of  people  in  the  whole  population  belonging  to  market  segment 
j  who  choose  travel  category  h  is  estimated  by  ft.,  =  f.r.,  where  f.  is 
an  appropriate  sampling  ratio  for  market  segment  j .  A  simulation 
technique  can  then  be  employed  to  assign  say,  specific  destinations. 
One  criterion  is  to  choose  the  closest  physical  destination  that  falls 
within  a  chosen  functional  destination  from  the  point  of  view  of  the 
location  of  a  households  or  cluster  of  households.  The  frequency 
distribution  of  individual  characteristics  that  will  be  used  in  the 
simulation  will  be  obtained  from  the  observed  distribution  of  those 
characteristics  for  the  N.  individuals  in  the  sample  belonging  to  market 
segment  j . 
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To  design  a  sampling  scheme  census  records  can  be  used  to  deter- 
mine the  geographical  distribution  of  socio-economic  and  population 
attributes  that  contribute  to  the  market  segmentation  process.  The 
city  block  geographical  coordinate  system  being  developed  for  some 
metropolitan  areas  will  provide  the  needed  geographical  framework  [97]. 
A  similar  treatment  will  be  given  to  the  geographical  distribution  of 
shopping  areas.  A  random  mail  or  telephone  survey  or  a  small-sample 
home  interview  can  be  taken,  say,  of  auto  travellers  after  randomly 
selecting  cells  [representing  subzones  or  a  finer  division]  in  the  scheme 
in  Fig.  5.1.  For  shopping  travel,  a  suitable  diary  [61]  can  be  solicited 
or  a  straightforward  interview  can  be  conducted.  The  important  require- 
ment is  that  one  be  able  to  collect  sufficient  information  to  adequately 
cover  all  market  segments.   It  is  possible  to  make  the  sampling 
efficient  with  a  prior  identification  of  market  segments  from  census 
data.  The  trade-off  between  taking  a  reasonably  large  sample  that  covers 
a  majority  of  the  market  segments  and  using  an  excessive  sample  to  cover 
all  the  market  segments  can  be  better  appreciated  this  way.  Under  the 
assumption  that  the  pattern  of  travel  of  market  segments  does  not 
change  drastically  with  respect  to  a  given  trip  purpose,  such  a  survey 
can  be  conducted  at  15  to  20  year  intervals. 

The  above  has  discussed  how  aggregate  predictions  of  functional 
travel  behaviour  can  be  accomplished,  supplemented,  with  simulation  to 
assign  people  to  specific  shopping  areas  that  are  impinged  upon  by 
primary  urban  travel.  In  situations  like  subarea  studies,  the  opening 
up  of  new  activity  centers  or  new  residential  areas,  etc.,  a  trans- 
portation planner  would  like  to  predict  the  travel  patterns  of 


86 


individuals  in  an  isolated  context.  A  logistic  discrimination  approach, 
later  discussed  in  this  chapter,  can  be  used  to  classify  individuals 
into  classes  of  functional  travel  behaviour.  The  random  sample 
discussed  earlier  in  this  section  can  be  used  to  develop  such  a 
discrimination  model  for  each  market  segment  or  for  the  whole  popu- 
lation in  which  case  the  market  segmentation  factors  will  contribute 
to  the  discrimination  function.  To  accomodate  the  possibility  of  the 
failure  of  such  a  discrimination  model  to  adequately  represent  relative- 
ly uncommon  categories  of  travel  behavior,  a  separate  sampling  logistic 
discrimination  procedure  will  be  outlined.   In  small  urban  areas  where 
the  underrepresentation  of  some  travel  behavioural  categories  is  less 
likely,  the  discrimination  model  can  be  developed  directly  to  estimate 
r.,  frequencies  that  can  be  expanded  to  obtain  aggregate  predictions 


R.,  . 


The  multinomial  response  model  provides  estimates  of  the  proportion 


of  market  segment  j  that  falls  into  functional  travel  category  h:-  II..  ; 
or  the  proportion  of  the  total  population  that  falls  into  functional 
travel  category  h:-   n,  .   It  also  allows  the  testing  of  interaction 
effects  that  may  affect  the  development  of  the  discrimination  model  if 
only  main  effects  are  considered.  The  logistic  discrimination  approach 
can  be  particularly  useful  in  a  small  urban  area  where  the  simulation 
previously  discussed  could  deal  with  predicting  the  probability  of 
individuals  being  associated  with  functional  travel  alternatives. 

It  must  be  noted  that  in  the  context  of  clearly  identified  non- 
spatial  choices  the  problem  discussed  in  this  section  is  considerably 
simplified  since  the  alternatives  are  specific  and  aggregate  predictions 
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can  be  made  more  directly.  The  incorporation  of  attitudinal  orientation 
in  the  market  segmentation  can  be  operational  if  it  is  possible  to 
relate  attitudes  to  some  socio-economic  or  population  characteristics. 
Not  much  success  has  been  realized  in  this  respect.  The  utility  of 
attitudinal  data,  meanwhile,  will  be  in  short  term  studies. 

It  is  possible  to  extend  the  travel  functional  classification  in 
Fig.  5.1  for  a  given  purpose  to  travel  frequency,  time  of  day  and  route 
functional  type  as  further  ways  of  classification.   Level  of  service 
attributes  developed  from  prevailing  traffic  conditions  will  go  into  the 
definition  of  route  functional  types.  Though  such  an  extensive 
functional  classification  is  theoretically  feasible  the  number  of 
travel  behavior  categories  can  easily  get  unwieldy.   It  is  then 
necessary  to  conduct  tests  of  independence  in  the  resulting  contingency 
table  so  that  the  block  (See  Fig.  5.1)  of  classification  can  be 
partitioned  into  manageable  subregions  for  separate  conditional  analyses. 
The  associated  extensive  data  requirements  for  such  an  exercise  should 
not  be  a  deterrent  since  it  would  in  the  long  run  throw  more  light  into 
the  analysis  of  travel  behavior. 

The  objection  may  be  raised  against  the  use  of  categorical 
variables  in  this  section  and  the  implied  reduction  of  "continuous" 
variables  like  airline  distance  and  travel  time.  The  present  level  of 
data  availability  in  many  aspects  of  travel  behaviour  and  scale  of 
measurement  is  such  that  one  cannot  help  conceding  that  most  of  the 
variables  on  hand  are  qualitative.  There  is  evidence  in  perceived 
data  on  travel  distance  and  travel  time  that  these  variables  are  usually 
reported  in  pulses  in  a  frequency  distribution  chart  [see  Fig.  5.2,  for 
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example] .  The  logistic  discrimination  approach  is  capable  of  mixing 
continuous  and  categorical  variables,  however,  using  a  similar 
transformation  of  continuous  distributions  into  discrete  ones. 

The  discussion  in  this  section  can  be  summarized  in  the  form  of 
Fig.  5.3  where  the  use  of  separate  analytical  tools  on  attitudes,  level 
of  service  on  routes,  functional  destination  classification  and  market 
segmentation  can  be  brought  into  a  single  perspective  for  the  purpose 
of  predicting  aggregate  travel  behavioral  demand.  There  is  provision 
for  iteration  within  a  simulation  context.  The  advantage  in  such  usage 
of  the  multinomial  response  approach  is  that  it  does  not  attempt  to 
develop  a  single  model  with  all  the  detailed  factors  and  variables  that 
go  into  the  functional  classification  of  spatial  choices,  market 
segmentations  and  attitudinal  categorization. 

Logistic  Discrimination 

Many  problems  often  crop  up  where  there  is  interest  in  discriminating 
between  two  or  more  populations  using  discrete,  categorical  and/or  con- 
tinuous variables.   In  urban  travel  behavior  analysis,  we  may  want  to 
discriminate  between  auto  users  and  transit  users  or  discriminate  among 
different  populations  that  are  associated  with  different  functional 
travel  alternatives.  The  greater  amount  of  previous  statistical  work  in 
discrimination  has  been  with  multivariate  normal  distributions  or  with 
techniques  based  on  Fisher's  linear  discriminant  function  [89]  which 
are  optimal  only  for  normal  distributions. 

Some  earlier  attempts  at  discrimination  with  qualitative  variables 
[98,  99,  100]  have  been  restricted  in  the  number  of  variables  permitted. 
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A  recent  method  by  Martin  and  Bradley  [101]  is  only  restricted  to  dis- 
crete variables.  A  very  useful  approach  is  that  of  logistic  discrimi- 
nation introduced  by  Cox  [102]  and  Day  and  Kerridge  [103]  and  further 

developed  by  Anderson  [90] .  Quoting  Anderson  [91] ,  "With  this 
technique,  continuous  and  discrete  variables,  jointly  or  separately, 
can  be  handled  with  equal  facility.  There  is  no  restriction  on  the 
number  of  variables  that  can  be  used  other  than  those  imposed  by 
sample  size  and  computer  time.   To  date,  20  variables  have  been  used 
and  there  is  no  doubt  that  larger  problems  could  be  handled--they 
have  just  not  arisen." 

The  problem  of  interest  is  to  discriminate  among  k  populations  H^ , 

H^,...,H,  based  on  the  vector  of  observations  x'  =  (x„,  x,,...,x  ) 
2      k  -    ^  0   1      p 

where  the  prime  indicates  a  transpose.  x_  =  1  to  account  for  a  constant 

term.   The  components  of  x  are  real  discrete  or  continuous  variables. 

Sample  points  or  observations  are  available  from  each  population 

s  (s  =  l,2,...,k)  and  the  discrimination  problem  is  to  develop  a 

means  of  allocating  further  sample  points  x  to  populations.   If  the 

likelihood  of  x,  given  H  is  f  (x)  and  the  further  sample  points  to  be 

allocated  are  from  a  mixture  of  distributions  of  H, ,  H_,...,H,  in  the 

k      12k 

proportions  11'  =  (11^,  11^,..., II,),  where   ^  II  =  1,  a  simple  way  of 

s=l 
optimizing  the  method  of  discrimination  is  to  maximize  the  probability 

of  correct  allocation  or  classification. 

The  sample  point  x  is  therefore  allocated  to  H  if  n  f  (x)  >_ 

n^f^(x)  (t  =  l,...,k;  t  ^  s) 

or 

P^(hJx)  ^  P^CH^Ix)   (t  =  l,...,k;  t  /  s)       5.17 
where  P  (a)  =  probability  of  'a'. 
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A  discussion  of  the  above  allocation  criteria  can  be  found  in  Rao 
[89] .   It  is  noted  here  that  the  multinomial  logit  model  for  travel 
choices  with  a  common  set  of  alternatives  has  the  same  allocation 
rule.   In  this  case  the  pooling  of  variance-covariance  estimates  dis- 
cussed  in  Chapter  3  comes  from  the  different  common  alternatives. 

Cox  [102]  and  Day  and  Kerridge  [103]  proposed  the  logistic  form 
for  posterior  probabilities  as  a  basis  for  discrimination  between  two 
populations: 

P^(Hj|x)  =  e^'?  .  P^(H2lx)  5.18 

where  a'  =  (a„,  a.,..., a  )  is  the  transpose  of  a  column  of  coefficients 
and 

PrCH^Ix)  =  l/(l+e~'~).  5.19 

a  is  estimated  directly  compared  with  the  standard  discrimination 
approach  with  multivariate  normal  distributions  which  essentially 
estimates  a  as  a  function  of  the  distribution  means  and  covariances. 

An  extension  of  equations  5.18  and  5.19  to  k  populations  [90] 
is  given  by 

Pr(H  |x)  =  e~'~s  .  Pr(H,  |x)   (s  =  l,...,k-l)   5.20 


k-1  xa 


s^ 


PrCR  Ix)  =  1/{1+  I     e   ^}  5.21 

s=l 

where  a'=(a,a,...,a). 
-s     s'     r  '     s 
u     1        p 

This  represents  an  extension  of  an  asymmetrical  binary  logistic 
distribution,  compared  to  Bock's  [86]  extension  of  a  symmetrical  form  as 
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a  result  of  which  the  estimation  of  a  mean  terra  in  the  symmetrical 
case  is  sacrificed.   Bock's  multinomial  response  model  and  the  multi- 
nomial logit  model  used  in  many  urban  travel  behavior  analyses  involve 
the  extension  of  the  symmetrical  binary  logistic  model.   Only  the 
parameters,  a  required  for  discrimination  are  estimated,  the  method 
being  the  same  for  all  f  (x)  satisfying  equations  (5.20)  and  (5.21). 
Classical  approaches,  however,  derive  the  allocation  rule  from  estimates 
of  all  the  parameters  postulated  in  each  distribution  with  the  result 
that  many  more  parameters  have  to  be  estimated  and  different  methods 
are  necessary  for  different  assumed  families  of  distributions  {f  ix)}. 
The  important  advantages  of  the  Cox-Day-Kerridge  approach  are  realized 
in  the  fact  that  equations  [5.20  and  (5.21)  are  satisfied  by  many  of  the 
families  of  distributions  commonly  postulated  in  discrimination. 
Examples  are: 

(i)   multivariate  normal  with  equal  dispersion  matrices 
(ii)  multivariate  independent  dichotomous  (0,1)  variables 
(iii)  multivariate  dichotomous  variates  following  the  log  linear 
model  [104]  with  second  and  higher  order  effects  the  same 
in  each  population, 
(iv)   a  combination  of  (i)  and  (iii)  . 

Equations  (5.20)  and  (5.21)  hold  exactly  or  approximately  in 
many  other  cases  and  will  hold  over  a  wider  class  of  distributions  if 
second  order  terms  are  included.  The  inclusion  of  second  order  terms 
compounds  the  number  of  parameters  to  be  estimated,  however. 


94 


Estimation  when  Sampling  From  a  Mixture  of  Distributions 

The  Cox-Day-Keridge  formulation  assumed  that  sample  points  x  were 

available  from  the  mixture  of  two  populations  in  unknown  proportions. 

Generalizing  this  to  k  populations  it  is  supposed  that  n   sample 

k  ~ 

points  are  observed  from  H  at  the  point  x.   Defining  J^  n   =  n  ,n  of 

s=l   ~    -   ~ 

a  sample  size  n  are  observed  at  x.  Usually  most  of  the  n   will  be  zero 
^  ~        -^  sx 

and  the  rest  unity.   If  /  n   =  n  ,  then  the  (n  }  are  random  variables. 
•'  ^  sx    s  s 

X 

Defining  <})  as  the  likelihood  of  the  mixture  distribution  at  x, 

k 

<i)     =     I     Jl  f   (x)  where  the  {n  }  are  the  unknown  mixing 
s=l 
proportions. 

The  log-likelihood  of  the  sample  can  be  written  as 

log  L  =  Constant  +  I  I  n^^  log(Ps^  4'^) 

X  s   ~       -   - 

where  P   is  the  probability  of  sample  point  x  being  allocated  to 

population  s. 

It  can  be  shown  [90]  that  the  maximum  likelihood  equations  for 

{ct  }  are 
s 

I  1°^  ^     =  I  (n^^  -  n  P  )x  =0   (s  =  l....,k-n     5.22 
da.      '^       sx    X  sx  J 
SJ      X    ~     ~    ~   -^ 

(j  =  0,1,. ..,p) 


The  solution  of  (5.22)  is  discussed  later. 

Estimation  Under  Separate  Sampling  of  Populations. 

In  many  situations  samples  are  taken  from  each  population  separately, 

in  which  case  the  {n  }  are  fixed.  The  mixture  of  distributions  from 

s 

which  the  sample  points  to  be  allocated  are  drawn  must  be  defined.  The 
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mixing  proportions,  n'  =  Cn,  ,Il2 » •  •  •  >ni^)  corresponding  to  populations 

k 
H^ ,  H»,...,H,  are  specified  where  J  n  =1.   Where  possible,  these 

s=l 
mixing  proportions  are  given  or  estimated  from  other  data.  The  esti- 
mation of  these  proportions  from  a  multinomial  response  relation 
analysis  has  been  previously  discussed.   Where  the  mixing  proportions 
cannot  be  estimated,  likelihood  ratios  can  be  used  as  later  discussed. 
The  joint  likelihood  of  the  samples  from  populations  H, ,  H2,...,H, 
is  given  by 

k 
log  L  =  constant  +77"   {log  L(x|H  )}  5.23 

s=l  x 

Temporarily  supposing  x.  is  dichotomous  with  values  0  or  1  (j=l,...,p) 

then 


L(x|H^)  =  pr(xlH^)  =  Pr(Hjx)  Pr(x)/Pr(H^) 


Letting  <},^  =  Pr(x) 

L(x|H  )  =  P   d,,/n  5.24 

^- 1  s     sx  ^x  s 

k 
leading  to  log  L  =  constant  +  Y  Y  n   log(P   (j,  )  5.25 

s=l  X   ~       ~   - 

where  the  constant  includes  terms  independent  of  the  {a  }  ^^'^   ^^e 

{*^}. 

The  likelihood  in  equation  (5.24)  is  identical  to  the  one  given  by 
the  Cox-Day-Keridge  formulation.   However,  the  unknown  parameters  here 
are  related  to  the  following  functionally  independent  conditions. 

I  *x  =  ^'  I   ^sx  K   =  "s   ^'  =  l>---.k-l)        5.26 
The  estimation  problems  in  the  two  cases  are  therefore  not  the  same. 
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With  the  method  of  constrained  maximum  likelihood  estimation  of 
Aitchison  and  Silvey  [73]  the  parameters  in  equation  (5.25)  can  be 
estimated  subject  to  the  constraints  in  equation  (5.26).  The  algebra 
in  this  procedure  is  very  heavy  while  the  asymptotic  dispersion  matrix 
is  unwieldy.  A  less  direct  approach  used  by  Anderson  [90]  greatly 
simplifies  the  problem. 

We  note  that  the  proportions  n  can  be  fixed  at  arbitrary  levels 
subject  to  the  usual  conditions  on  probabilities.  Considering  a  set  of 
proportions,  n,  to  which  all  the  preceeding  equations  in  this  section 


apply,  and  a  second  set  of  proportions,  H',  for  which 

X  a'  I 

Pr'(H^lx)  =  P^^  =  e~  ~  •  Pr'(Hj^lx)   (s  =  l,...,k-l)   5.27 


T 
where  x  =  x  -transpose 


k-1  X  a 


Pr'CH^Ix)  =  p^  =  1/{1  *     I     e"    '   ^}  5.28 

s=l 


and 


Pr'(x)  =  <(:•  for  all  X 
Since  the  samples  from  L(x|H  )  do  not  depend  on  the  choice  of  n 

Lf^lV  ^  ^sx  V^s  =  P;x  *'x/n;   Cs  =  l,...,k)        5.29 

Dividing  the  sth  equation  of  5.29  by  the  kth 

(e   +x  a^) 

P'   =  e        P;(x)  5.30 

sx  k  ~ 

k-1   (B  +x  aj 
^kx  =  ^/^^^  I     ^     s  -  ~s  J  5_3j 

s=l 
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where  3  =  log  {n  n*  /n*  n  } 

5  K   5     K  5 


It  is  noticed  that 


a'   =  a   +  P 
so    so    s 

a..=a.  5.32 

S3    sj 


(s  =  l,...,k-l)   (j  =  l,...,p) 

The  above  implies  that  only  the  {a  }  are  changed  when  H  is  varied, 
all  other  coefficients  {a  .,  jj^O}  being  the  same.  The  maximum 
likelihood  estimates  of  the  {a's)  for  one  IT  can  be  derived,  therefore, 
from  those  of  another,  say,  IT'.  The  choice  of  II  =  n*  =  n  /n  consid- 
erably simplifies  the  constrained  maximization  problem  [90] .   For  this 
reason  the  most  appropriate  procedure  is  to  carry  out  the  estimation 

with  n*  and  then  adjust  the  {a  }  for  the  defined  5. 
~  -"         so 

Anderson  [90]  has  shown  that  the  estimates  of  the  {a   }  with  n  =  n* 
given  by  maximizing  the  likelihood  in  equation  (5.25)  subject  to  the 
constraints  in  equation  (5.26)  is   given  by: 

I    (Hg^  -  n/sx^^i  "  °   (s=l,...,k-l;  j=0,l,...,p)   5.33 
X    -    -  -  ■' 

This  result  is  obtained  after  a  considerable  amount  of  algebraic 
reduction  using  Lagrange  multipliers. 

It  is  noted  that  equations  (5.22)  and  (5.33)  are  identical.  The 
same  procedure  can  therefore  be  used  to  estimate  the  {a  }  in  both 
situations--mixture  and  separate  sampling  discrimination.  Anderson 
[90]  has  used  Aitchison  and  Silvey's  [73]  result  on  asymptotic  dispersion 
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matrices  with  constrained  maximum  likelihood  to  show  that  the 

asymptotic  dispersion  matrices  of  the  estimates  of  {a  }  for  separate 

and  mixture  sampling  differ  only  in  the  variance  and  covariances 

associated  with  the  (a  }. 

so 

Though  the  above  discussion  has  considered  only  dichotomous  vari- 
ables, Anderson  [90]  has  shown  that  the  methods  can  be  extended  to 
continuous  and  polychotomous  variables.  This  is  outlined  below: 

If  the  observations  are  continuous  the  log  likelihood  in  equation 
(5.25)  is  unchanged  but  the  constraints  in  equation  (5.26)  become: 

J*^d^  =  1,  J  P^^  (J.^  dx  =  n^     (s  =  l,...,k-l)       5.34 

The  maximization  of  log  L  in  equation  (5.25)  subject  to  constraints 
in  equation  (5.34)  requires  assumptions  about  the  functional  form  of 
<t>    .     This  can  entail  extra  parameters  8  for  4>     and  the  resulting 
maximum  likelihood  equations  will  not  reduce  to  two  sets  for 
A  =  [a^,...,ci,  ^]  and  6  separately.   Such  a  problem  is  similar  to  the 
classical  approaches  with  all  their  shortcomings.  However,  it  is  possi- 
ble to  get  around  such  a  problem  by  making  the  continuous  distribution 
discrete  and  estimate  <))  A  in  which  case  the  solution  of  equations 
(5.33)  can  be  used. 

If  we  put  4-  =  (t)  A   (for  all  x)  5.35 

then  the  conditions  in  equation  (5.34)  become,  approximately 

yij)A=i,  y(f)PA=n 

^  ^X   X  ^  ^X   SX   X     s 

X   ~   ~  X   ~    ~   ~ 

or  ""  ~  5.36 
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yi'=i  y¥P=n  5.36 

''       X  ^       X        SX  X 

X       ""  X       ^        ""  ~ 

.(s=l,...,k-l) 


The   likelihood  then  becomes 


k-1  P       ((i  n 

„    n     n  c-^p^  A^)     ~ 

S=l    X  s  - 


k-1  P        4*     n 

=     n     n   (  ^-     -)   ^~  5.37 

S=l    X  s 


The  maximization  of  the  expression  in  equation  (5.37)  subject  to 
the  constraints  in  equation  (5.36)  is  then  similar  to  that  discussed 
leading  to  equations  (5.23)  to  (5.33).   It  is  demonstrated  in  Anderson 
[90]  with  some  simulation  that  the  above  approach  gives  good  estimates 
of  the  discriminant  function  when  k-2  and  the  underlying  distributions 
are  multivariate  normal.  .; 

When  the  mixing  proportions,  IT,  are  unknown  or  cannot  be  estimated, 
the  allocation  rule  of  equation  (5.16)  cannot  be  used.  The  obvious 
criterion  to  use  in  this  case  is  the  likelihood  ratio  for  which  esti- 
mation equations  (5.22)  and  (5.33)  can  still  be  used. 

Defining  the  likelihood  ratio  for  H  and  fl  as  R  ^,  and  assuming 
^  s       t      St 

hypothetical  proportions  IT 

L(x|H  )     P   n, 
- '  s       sx  t 


St   L(xH,)     P^  n 
~ '  c       tx  s 


from  which 


where 
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T 
log  R   =  X  (g  -  g  )  -  r      (s,t,  =  l,...,k-l) 

^^    „  ^    ^     ^^  5.38 

log  R^^  =  x'iq^   -   T^^)         (s  =  l,...,k-l) 

r^^  =  log  (n^/n^),     (s,t=l,...,k) 
If  the  {a  }  are  estimated  with  II*  defining  a  hypothetical  mixture, 


estimates  of  the  likelihood  ratios  can  be  obtained  from  equation  (5.38). 

The  Newton-Raphson  iterative  procedure  has  been  used  by  Cox, 
Day  and  Kerridge  and  Anderson  [90,  103]  for  estimating  {a   }  in  equations 
(5.22)  and  (5.33).  This  procedure  has  already  been  discussed  in  con-  • 
nection  with  the  multinomial  response  model  earlier  in  this  chapter. 

The  likelihood  equations  (5.22)  and  (5.33)  can  be  written  as 


9  log  L  ^  ^ 

■'        X 

(s  =  1,. ..,k-l;  j  =  0,.,.. .,p) 


^  .  =  y  (n   -  n  P  )x.  =  0 
3a.     sj   '^  ^  sx    x  sx-^  1 


9  f 


y^  -  ^  "x  ^sx  ^x  -j    -1    f^^^) 


5.39 
8  f 


- — ^  =  -Y  n  P   (1-P  )  X.  x, 

3a,     ^  X  sx     sx^   J   1 

si    X  ~   ~      ~   -- 


The  (k-1)  (p+1)  square  matrix  F  with  elements  F  .  ^  =  3f  ./Sa^-,  is 
equivalent  to  the  matrix  4*  defined  in  connection  with  the  multinomial 
response  model  in  equation  (5.11). 

Choosing  Between  Mixture  and  Separate  Sampling 

Though  there  is  little  difference  in  the  asymptotic  properties  of 
the  dispersion  matrices  for  the  maximum  likelihood  estimates  for 
mixture  and  separate  sampling  logistic  discrimination,  Anderson  [90] 
recommends  separate  sampling  provided  the  n  can  be  preselected  to  be 
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approximately  equal,  n  being  the  fixed  sample  size  for  population  s 
in  separate  sampling.  This  provides  a  balance  among  the  n  .   In 
mixture  sampling  the  n  are  random  variables  and  there  is  a  strong 
possibility  that  travel  behavioral  responses  of  low  incidence  (e.g. 
transit  users,  senior  citizens  and  other  captives,  in  auto-oriented 
cities)  can  be  missed.   In  spite  of  the  intractability  of  the  small- 
sample  properties  of  maximum  likelihood  estimation  Anderson  expects 
that  for  a  given  total  sample  size  n,  samples  with  balanced  n  give 
better  estimates  on  trie  average  than  those  with  imbalance,  particularly 
where  some  of  the  populations  have  a  relatively  higher  incidence  than 
the  others.  The  advantage  of  separate  sampling  over  mixed  sampling 
will  definitely  be  better  realized  in  large-sampling  estimation  with 
some  relaxation  in  balancing  n  .   For  stable  parameter  estimates  that 
can  be  used  for  forecasting  and  computation  of  quantities  like  the 
value  of  travel  time  and  elasticities,  large  samples  made  efficient 
through  an  appropriate  partitioning  of  the  sample  size  among  the  n 
in  order  not  to  over-represent  or  under-represent  any  travel  population, 
is  recommended  for  transportation  studies.   Examples  of  the  separate 
populations  include  auto  users  and  transit  users  in  mode  choice 
analysis,  auto  travelers  identified  with  a  given  functional  category 
of  shopping  travel  destination  in  shopping  trip  destination  choice 
analysis,  people  associated  with  a  given  trip  frequency  for  a  given 
purpose  in  "trip  generation",  and  people  engaged  in  some  non-work 
trip  at  a  given  time  of  day,  in  analyzing  the  effects  of  time  of  day. 
Car  license  plates,  credit  card  addresses,  and  direct  interviews  of 
transit  riders  on  board  are  all  methods  that  can  be  investigated  for 
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planning  such  separate  sampling.  With  the  high  k  value  that  is  bound 
to  prevail  in  most  transportation  studies  one  cannot  expect  to  leave 
to  chance  the  likelihood  of  covering  all  the  k  travel  "behavioral 
populations"  in  mixture  sampling.  Suppose  there  are  three  functional 
travel  destinations  D  ,  D-,  D,  with  relative  patronage  proportions  10%, 
15%,  and  75%, a  sample  size  of  500  would  be  needed  to  cover  50  D  , 
75  D^  and  375  D_  in  random  mixture  sampling.   In  separate  sampling  a 
sample  size  of  only  225  would  be  required  to  cover  75  D^ ,  75  D~  and 
75  D_  with  an  estimate  of  the  populations  obtained  from  a  different 
data  source. 

Methodology  for  Data  for  This  Research 

With  the  Multinomial  Response  Relation  model  [MRR]  as  illustrated 
in  Figure  5.1.    each  cell  of  box  P  is  equivalent  to  a  submarket 
segment  which  is  defined  at  a  finer  level  than  a  sampling  scheme  can 
achieve.  A  sampling  framework  can  stratify  a  population  according  to 
variables  like  income  and  age  but  further  variables  and  factors  like 
attitude,  "accessibility",  the  number  of  pre-school  children,  and  an 
intervening  opportunity  field  relative  to  a  household  location  are  more 
difficult  to  specify  prior  to  data  collection.   It  is  supposed  that  the 
ways  of  classification  that  contribute  to  the  matrix  X  in  the  discussion 
of  the  MRR  model  are  A,  B,  C,  D.   What  logistic  discrimination  essen- 
tially does  is  to  use  for  example,  A  (or  A  and  B)  for  market  segmen- 
tation that  can  be  developed  prior  to  major  data  collection  (e.g. 
segmentation  according  to  income  or  structure  type) .  The  logistic 
discrimination  approach  then  develops  a  model  for  each  market  segment 


103 


provided  observations  are  available  for  such  a  purpose.   For  each  market 
segment  B,  C,  D(  or  C  and  D)  are  used  as  the  basis  of  discrimination 
among  k  populations.  The  MRR  approach  estimates  a  probability  distri- 
bution function  for  choosing  among  the  cells  of  box  Q  in  Figure  5.1  for 
each  cell  j  in  box  P  in  Figure  5.1.  The  MRR  procedure  thus  uses  the 
factors  characterizing  the  attraction  measures  of  destination  and  their 
relative  location  to  develop  probability  distribution  functions  for  the 
different  market  segments.   In  the  case  of  the  logistic  discrimination 
method,  however,  the  characteristics  of  the  destinations  that  define 
the  cells  of  box  Q  of  Figure  5.1  are  not  used  any  further  since  the  cells 
are  thought  of  as  mutually  exclusive  populations.   If  the  number  of  ways 
of  classification  that  goes  into  Q  is  high  there  can  be  a  large  number 
of  populations  for  which  extensive  data  collection  will  be  needed.   In 
addition,  there  can  be  several  cases  of  low  frequencies  within  some 
cells.   The  separate  sampling  logistic  discrimination  approach  alleviates 
the  difficulty  with  low  cell  frequencies  provided  the  relative  population 
proportions  can  be  estimated  from  more  cheaply  available  data.  Though 
the  likelihood  ratio  can  be  used  as  the  criterion  for  classification  in 
the  event  the  proportions  cannot  be  estimated,  the  procedure  can  be 
computationally  tedious  in  operational  transportation  planning. 

There  is  evidence  that  spatial  choices  of  destinations  are  more 
influenced  by  the  characteristics  of  the  alternatives  than  by  individual 
socio-economic  or  the  transportation  network  configuration  relative  to 
individuals  [10] .   What  this  means  is  that  after  a  functional  classi- 
fication has  been  achieved  the  only  essential  factors  that  remain  may  be 
on  attitudes,  relative  accessibility  and  intervening  opprotunity  fields. 
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With  the  nature  of  available  data  on  destination  choice  in  general, 
the  way  in  which  Kannel's  data  was  collected,  and  the  limited  sample 
size  for  the  latter,  empirical  testing  of  the  separate  sampling  logis- 
tic discrimination  procedure  could  not  be  accomplished  in  this  research. 
This  does  not  mean  that  logistic  discrimination  is  not  a  feasible 
approach.   If  one  can  define  sufficient  variables  that  influence  the 
choice  of  functional  shopping  classifications,  models  for  discriminating 
among  functional  destination  categories  can  be  achieved.   One  may  at 
this  stage  question  the  need  for  using  functional  destination  alter- 
natives but  this  takes  one  back  to  the  problem  of  choice  set  definition 
in  which  we  have  concluded  that  any  approach  using  physical  alternatives 
for  destinations  on  a  large  scale  is  impractical.   Where  there  is  a 
common  physical  choice  set  as  in  mode  choice  or  a  conceptual  choice  set 
as  in  time  of  day  choice,  one  does  not  only  have  few  populations  to 
discriminate  among,  but  there  is  not  the  possibility  of  exhausting 
traditional  variables  in  the  definition  of  a  functional  choice  set. 

In  the  opinion  of  the  writer,  the  logistic  discrimination  method, 
particularly  when  efficiently  used  with  separate  sampling,  could  be  an 
excellent  tool  in  such  situations. 

To  summarize  the  above  discussion,  the  MRR  approach  using  a  random 
sample  was  the  more  reasonable  procedure  to  use  to  estimate  a  destination 
choice  model  using  the  data  available  for  this  research.  The  cells  in 
box  P  of  Figure  5.1  can  be  appropriately  aggregated  prior  to  a  suitable 
factoring  for  aggregate  prediction.  The  contribution  of  the  discussion 
on  logistic  discrimination  relates  to  how  it  can  be  tied  to  efficient 
sampling  for  an  operationally  economical  data  collection  shceme,  an 
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area  that  further  research  with  suitably  collected  data  can  be  addressed 
to.  The  conclusion  therefore,  is  that,  with  the  available  data  on 
destination  choices,  one  is  compelled  to  use  the  MRR  approach  which  may 
be  inefficient  in  its  sampling  scheme.   With  the  availability  of  further 
variables  that  are  identified  through  attitudinal  work  or  the  definition 
of  better  level  of  service  variables  and  socio-economic  factors,  one  can 
consider  using  logistic  discrimination  with  more  efficient  sampling. 
Only  the  MRR  approach  is  estimated  and  discussed  in  succeeding  chapters. 
The  provision  for  aggregation  with  the  MRR  method  is  only  a  natural 
consequence  since  the  indicators  and  foundations  for  aggregation  are 
derived  from  the  level  to  which  the  cells  of  box  P  in  Figure  5.1  can 
be  put  together  to  a  stage  where  a  data  sampling  scheme  based  on 
secondary  data  can  define  an  appropriate  expansion  factor  for  aggregate 
predictions. 

Large  Sample  Maximum  Likelihood  Estimators  and  Their  Statistical 
Properties. 

The  procedures  of  multinomial  logit  analysis  with  a  common  set  of 
alternatives  [See  Chapter  3] ,  and  the  multinomial  response  relation  and 
logistic  discrimination  analyses  can  also  be  viewed  essentially  as 
specifying  cell  (physical  or  functional  alternative)  probabilities  as 
functions  IT  (6)  , .  .  .  ,n,  (§)  involving  q  unknown  parameters  (0,,...,e  )  =  6' 
Such  a  specification  is  for  a  given  market  segment.  The  IT.  as  discussed 
here  are  cumulative  distribution  functions  of  the  logit  or  logistic 
distributions  for  the  situations  cited. 
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A  form  of  asymptotic  efficiency  as  given  by  Rao  [89]  is  now 

defined: 

Let  PC-,...,*|9)  be  the  probability  density  of  the  random  variables 

X, , . . . ,x  and  define  the  random  variable 
1      n 


J  d  log  P(x^,...,x^|0) 


Z  =- ^ 5.40 

n   n        de 

A  consistent  estimator  T  of  6  is  said  to  be  efficient  if 

n 

*^It  -  6  -3(9) Z  I  ^-  0  in  probability  or  with  a  probability  of  1 
where  3  does  not  involve  the  observations. 

In  the  multiparameter  case  where  the  parameter  vector  is 
6'  =  (6^,  6^,..., 9  ),  a  vector  of  derivatives  is  defined  as 

-n   ^  n*    '  n"^ 

where         z^  =  ^  ^-^^   ,    i  =  l....,q 

1 

and  a  vector  of  deviations  of  the  estimators  from  the  true  values  as 


D'  =  (T  -  6)' 

=  (T^  -  9,,...,T^  -  6  ). 
^  n    1'    '  n    q-^ 

T     is  said  to  be  efficient   if 
-n 

yn"lD     -   BZ    I   ->  0  5.41 

' -n       ~~n' 

in  probability  or  with  probability  1,  where  6  is  a  matrix  of  constants 
that  may  depend  on  6 . 

Getting  back  to  the  original  problem  introduced  in  the  first 
paragraph,  9  is  assumed  to  be  an  efficient  estimator  of  9  in  the  sense 
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of  (5.41),  and  that  each  n. (6)  admits  continuous  partial  derivatives  of 

the  first  order  (only)  with  respect  to  6 . ,  j  =  l,...,q.  Alternatively, 

it  is  assumed  each  n.  (0)  is  a  totally  differentiable  function  of  9 

e  ,...,0  .  The  matrix  M  is  defined  such  that: 
z      q 

-1/2 
M  =  (n     311  / 36  )  of  order  (kxq)  computed  at  the  true  values  of 

0  with  rank  q. 

The  asymptotic  distribution  of 

2 


^     (n.  -  nn.)" 

2     VI       ~1  r  At 

X  =  Z.  :: 5.42 


nn. 

~1 
2  .       ^ 

is  X  (k-l-q)  where  H.  =  n.(0)  and  n.  and  n  are  the  observed  frequency  in 

cell  i  and  total  number  of  observations,  respectively.   Rao  [89]  has 

discussed  that  maximum  likelihood  estimators  of  6  for  the  multinomial 

distribution  are  consistent,  provided  they  exist.   Previous  discussions 

in  Chapter  3  and  in  the  present  chapter  have  given  conditions  for  the 

existence  of  the  maximum  likelihood  estimators. 

The  goodness  of  fit  testing  of  equation  (5.42)  would  be  adequate 
if  one  was  interested  only  in  the  overall  performance  of  the  postulated 
model  without  any  inference  on  specific  or  groups  of  elements  of  the 
parameter  estimator  0.   However,  in  urban  transportation  planning  one 
may  be  interested  in  functions  of  the  elements  of  0  in  the  computation 
of  elasticities  and  values  of  travel  time  for  different  situations.  To 
make  proper  inference  about  the  components  of  6  the  joint  distribution  6, 
which  is  tractable  only  in  large  samples,  is  needed. 

It  is  assumed  that  x^ , . . . ,x  are  identically,  independently  distri- 
buted random  variables  that  may  be  multidimensional.   P(x,e)  is  defined 
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as  the  probability  density  depending  on  a  q-dimensional  parameter 

9'  =  (6   e   . . ,6  ).   9  is  assumed  to  be  admissible  in  that  it 
^12      q 

belongs  to  a  nondegenerate  interval  of  the  q-dimensional  Euclidean 
space.  The  log  likelihood  of  n  independent  observations  is 

1(9)  =  log  p(x,e)  +  log  p(x2,e)  +  ...  +  ...iog  pCXj^,e) 

The  first  derivative  of  the  log  likelihood  function  evaluated  at 

9  with  respect  to  9.  is 
^       ~i 

31    ,   1 '   P^^l'g)  1    3  PCx^'?) 


39.     P(x,,e)     39.      •••   PCx  ,9)     39. 
1     ^  1  --^      1  ^  n  ~      1 

After  Rao  [89]  the  following  efficient  score  is  defined. 

^1=7:397  ^-'^ 

vn       1 

The  information  matrix  on  9  in  a  single  observation  x,  is  defined 


as 


^=  C^  )  =  E{-i ^  •  -i ^  T  5  44 

~    '■^rs-'     P(x,e)   36    P(x,e)   39  > 

where  E  denotes  "expectation".  Defining  the  vector  V  =  (((),  (9)  >  •  •  •  >(})  (e)) 

it  can  be  proved  [89]  that  the  asymptotic  distribution  of  V  is  q- 

variate  normal  with  mean  zero  and  dispersion  matrix  — ^ 

^  n  - 

Under  some  regularity  conditions  [89]  for  P(x  ,9)  and  denoting 
consistent  roots  of  the  equations  of  the  likelihood  equation  based  on  n 
observations  by  F,  the  following  asymptotic  equivalencies  can  be  estab- 
lished. 
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V  =  D  and  211(e)  -  1(e)]  =  D'^D,  where 

D'  =  I>^(6j-ep,...,y^(e^-e^)] 

The  equivalence  between  these  asymptotic  equivalences  and  the  Newton- 

Raphson  procedure  for  maximizing  the  log  likelihood  function  is  clear. 

It  is  recalled  here  that  11.  is  a  function  of  e  =  (e,,...,e  )  where 

1  ~     1      q 

the  vector  e  belongs  to  an  admissible  set  0.   It  is  supposed  that  the 
true  value  e°  is  an  interior  point  of  0.   Before  the  asymptotic  distri- 
bution of  the  maximum  likelihood  estimators  of  6  is  introduced  it  is 
necessary  to  make  some  assumptions  [89] .   Some  of  these  assumptions 
have  been  previously  stated  but  the  whole  set  is  given  here  for 
completeness. 

Assumption  1.1:  Given  a  6  >  0,  it  is  possible  to  find  e  such  that 

n.(e°) 


inf        InAB')  log  -^^    2L  ^ 
!e-e°l>6    '  ~  "i^^~^ 


|e-e°|>6 

where  I  e  -  e°  |  is  the  distance  between  e  and  e° .  This  assumption 

may  be  referred  to  as  a  strong  identifiability  condition,  implying 

that  outside  the  sphere  |e  -  6°  |  _<  5,  there  is  no  sequence  of 

points  9  such  that  11(9  )  ->-  Il(9°)  as  r  ^-  »,  i.e.  there  are  no 

values  of  9  remote  from  9°  but  yielding  nearly  the  same  set  of 

probabilities  11.(9°). 

Assumption  1.2:   II- (9)  ^   11.(6)  for  at  least  one  i  when  e^  B, 

which  is  a  weaker  identifiability  condition. 

Assumption  2.1:  The  f unctions  IT. (9) ,  i=l,...,k  are  continuous  in  9, 

Assumption  2.2:   The  functions  11.(9)  admit  first-order  partial 

derivatives. 
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Assumption  2.3:  The  functions  11.(6)  admit  first-order  partial 
derivatives  that  are  continuous  at  9°. 


Assumption  2.4:  The  functions  11.(0)  are  totally  different iable 
at  Q° . 

Assumption  3:   Let  the  information  matrix  ^be  non-singular  at 
0°  where 

<if      =  E   i-^ ^^  -^ l^T 

P(x,0)   30^   P(x,6)   90.^ 

q  ^3II^3n^ 
"  j=l  "j  ''r   ''s 

6*  is  defined  as  an  approximate  maximum  likelihood  estimator  if 

Lln(9*)]  >_c   Sup  L[n(6)]  5.45 

eeQ 

That  assumption  1.1  implies  0*  -^  0°  with  probability  1  as  n  ^^  is 

shown  as  follows: 

It  can  be  proved  that  [89] 

n.(6*) 

J^   P.  log  — p >■  0  over  the  admissible  set  9.   Since 

i 

log  11.(0*)  are  bounded  as  P  ->  n°, 

(p  =  (P^,...,P,^),  n°  =  (nj(0°),...,n^(0°)) 
n.(0*) 

711.(0'')  log   .~8>.  -*■  0  over  the  admissible  set  0. 
^  ~       n.(.0  J 

From  assumption  1.1  |0*  -  0°|  <_  6  with  probability  1  in  which  case 
0*  ->  9°  with  probability  1  as  6  is  arbitrarily  chosen  close  to  zero. 
Assumptions  1.1  and  2.1  imply  that  the  maximum  likelihood  estimator. 
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9,  of  6  exists  and  converges  to  6°  with  probability  1  [89].  Assumptions 
1.1  and  2.2  imply  that  the  maximum  likelihood  estimator  is  the  root  of 
the  equations 

1^  =  0,    i  =  l,...,k  5.46 

39. 

with  a  probability  of  1. 

By  choosing  6  in  such  a  way  that  the  sphere  |6  -  9° |  <_  5  in  the 
interior  of  0,  the  supremum  of  L [II (9)]  is  attained  in  an  open  interval 
as,  say  0.   If  L[n(9)]  is  differentiable  (assumption  2.2)  the  partial 
derivatives  must  vanish  at  9. 

Under  assumptions  1.2,  2.3,  and  3,  and  denoting  the  (rs)th 
element  of  the  inverse  of  ^by  i   and 

k   P.   an. 


^T  ~    I.   n.(9°)  W^  ' 

1=1  i^~  ^  r 


there  exists  a  consistent  root  9  of  the  likelihood  equation  (5.46)  v/hich 
may  not  be  a  maximum  likelihood  estimator  and 

»^l6^  -  e;  -  i""^  Zj-.-.-i'^'^Z^I  -.  0    r=l,...,q. 

with  a  probability  of  1. 

This  implies  that  the  maximum  likelihood  estimators  are  efficient 
in  the  sense  of  equation  (5.41)  and  that  their  asymptotic  distribution 
is  q-variate  normal  [89] . 

i.e.  9  -^  N  (9°,  i^^)  5.47 

n^<»    " 

where  N  denotes  a  q-variate  normal  distribution. 

q 
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All  the  discussion  so  far  on  maximum  likelihood  estimators  applies 
to  the  estimators  for  a  multinomial  logit  model  for  a  common  alterna- 
tive set,  multinomial  response  relation  and  logistic  discrimination 
estimation  problems.  That  all  the  assumptions  are  satisfied  by  the 
symmetric  and  asymmetric  logistic  distributions  defining  IT. (9)  in  the 
discussion  is  clear. 

Summary . 

The  theoretical  development  of  the  multinomial  response  relation 
procedure  and  its  parameter  estimation  are  discussed,  with  a  detailed 
presentation  on  its  specific  application  to  problems  of  interest  in 
this  research.  The  advantage  in  the  ability  of  the  method  to  include 
interaction  effects  in  developing  predictive  models  is  emphasized.   In 
order  to  consider  both  categorical  and  continuous  variables,  the  latter 
are  partitioned  into  appropriate  classes.  This  is  particularly  impor- 
tant in  destination  choice  modelling  where  most  of  the  factors  are 
qualitative.  The  application  of  the  multinomial  response  relation 
approach  to  functional  destination  choice  modelling  is  discussed  in  terms 
of  cross-classification  tables  both  for  market  segmentation  and  functional 
destination  definition.  The  logical  provision  for  making  aggregate 
forecasts  of  functional  destination  choice  implied  in  the  use  of  the 
multinomial  response  relation  approach  is  discussed  as  well  as  its 
possible  extension  to  other  travel  decisions. 

The  theory  behind  a  recently  developed  logistic  discrimination 
technique  is  presented  and  the  technique  recommended  as  the  eventual 
operational  statistical  to  be  used,  given  standardized  functional 
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destinations.   It  is  pointed  out  that  this  discrimination  technique  can 
be  readily  applied  to  travel  decisions  like  mode  and  time  of  day 
choices.  The  extension  of  the  mLxed  (or  mixed  population]  logistic 
discrimination  technique  to  the  separate  sampling  situation  is  also 
discussed,  the  latter  being  especially  recommended  for  travel  behaviour 
modelling.  This  is  to  ensure  that  one  captures  all  essential  travel 
market  segments  without  excessive  data  collection  at  unnecessary 
expense. 

Large  sample  statistical  properties  of  maximum  likelihood  estimators 
are  discussed.   It  is  pointed  out  that  it  is  only  under  large  sample 
estimation  that  one  can  take  advantage  of  asymptotic  statistical 
distributions  of  parameters  to  make  proper  statistical  inferences  and 
develop  stabler  and  more  representative  models.   Separate  sampling 
model  development  strikes  a  suitable  compromise  between  the  statistical 
large  sample  requirement  and  the  need  to  cover  all  market  segments  on 
the  one  hand  and  the  parsimony  in  minimizing  on  manpower  and  fiscal 
resource  expenditure  in  data  collection  on  the  other. 
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CHAPTER  6 

ATTITUDINAL  CONSIDERATIONS  IN 

DESTINATION  CHOICE  MODELLING 

This  chapter  discusses  the  statistical  preparation  of  successive 
categorical  data  in  which  ratings  were  assigned  to  different  factors 
that  may  influence  the  selection  of  a  general  shopping  area  destina- 
tion. Such  preparation  is  necessary  to  minimize  multicollinearity 
or  interaction  among  the  factors  prior  to  their  use  in  market  segment- 
ation in  predicting  the  choice  of  functional  destination  alternatives. 
Similar  considerations  with  respect  to  successive  category  data  in 
mode  choice  analysis  have  been  discussed  in  recent  transportation 
planning  literature  [78,  79] .  The  normality  assumptions  in  the  use 
of  the  raw  attitudinal  data  for  some  multivariate  analyses  and  classi- 
cal approaches  with  roots  in  Thurstone's  Law  of  Categorical  Judgement 
[85,  86]  do  not  appear  to  hold  very  well  for  the  attitudinal  data 
available  for  this  research.  Since  there  is  no  guarantee  that  furth- 
er data  collection  would  provide  a  better  data  base  to  fit  classical 
approaches,  and  considering  the  immense  data  collection  cost  and  the 
time  lag  between  successive  data  collection  efforts  in  transportation 
planning,  it  is  necessary  that  other  statistical  approaches  be  used 
to  derive  the  best  information  from  available  data.  With  the  quali- 
tative nature  of  the  variables  there  is  heavy  reliance  on  contingency 
table  techniques  in  this  chapter. 
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Description  of  Available  Successive  Category  Data 

The  preception  by  individuals  of  factors  that  could  affect  the 
choice  of  shopping  destinations  was  solicited  by  Kannel  [10]  through 
a  7-point  successive  category  rating  scale  [Figure  6.1].  The  higher 
the  rating  for  a  factor  the  higher  its  level  of  importance  to  a  given 
individual.  Table  6.1  shows  a  listing  of  the  factors  and  their  mean 
ratings  for  the  total  sample  used  in  Kannel's  work  [10].  The  ratings 
give  the  impression  that  the  choice  of  shopping  destinations  is  more 
influenced  by  the  characteristics  of  the  shopping  areas  than  by  the 
characteristics  of  the  transportation  system.  Factors  related  to 
shop  characteristics  that  received  high  ratings  were  "a  wide  selection 
of  goods",  and  "appearance  of  the  store  and  surroundings".  Out  of  15 
factors,  the  descriptors  referring  to  distance,  travel  time  and 
travel  cost  were  ranked  in  positions  11,  12,  and  13  respectively. 
Since  most  shopping  area  developments  are  tied  up  with  parking  the 
factors  relating  to  the  ease  of  parking  and  the  cost  of  parking  were 
considered  as  part  of  the  shopping  area  characteristics.  These  two 
factors  were  rated  second  and  tenth,  respectively.  The  ability  to 
quickly  find  a  parking  spot  was  ranked  fifth. 

As  is  indicated  later  in  this  chapter,  the  factors  in  Table  6.1 
are  not  all  independent.  The  average  rating  is  of  no  use  in  pre- 
dicting shop  destination  choice  behavior  unless  one  has  a  very  large 
sample  for  which  the  rating  scales  for  different  subsamples  would 
differ  substantially  in  which  case  the  effect  of  the  different  scales 
can  be  considered  in  model  development.  The  sample  size  available 
to  this  research  was  limited  and  it  was  unlikely  to  find  much  difference 
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When  you  select  a  place  to  go  shopping  how  important  is  it: 

Not 

At  All 

Important 


A.  to  go  in  an  area  of  town  you  are 
familiar  with? 

B.  to  go  where  you  can  walk  around 
and  shop  at  several  stores? 

C.  to  go  where  there  are  many  employees 
to  help  you? 

D.  to  go  where  there  is  a  wide  selection 
of  items  to  choose  from? 

E.  to  go  where  you  can  make  the  trip 
in  the  shortest  time? 

F.  to  go  where  it  costs  less  to  park? 

G.  to  go  where  many  other  people  do 
their  shopping? 

H.   to  go  to  places  where  the  roads 
are  not  congested? 

I.   to  go  where  you  don't  have  to  walk 
far  after  you  get  off  the  bus  or 
out  of  the  car? 

J.   to  go  where  you  can  easily  return 
home? 

K.  to  go  where  you  can  ride  to 

several  other  stores  at  different 
locations? 

L.  To  go  where  you  can  quickly  find 
a  place  to  park? 

M.   to  go  where  the  cost  of  travel  is 
less? 

N.  to  shop  as  close  to  home  as 
possible? 

0.   to  go  where  the  stores  and 
surroundings  have  a  pleasant 
appearance? 


2 
2 
2 
2 

2 

2 

2 
2 

2 
2 


3 

3 

3 

3 

3 
3 

3 

3 

3 
3 


3 
3 
3 


4 

4 

4 

4 

4 
4 

4 

4 

4 
4 


4 
4 
4 


Of 
Greatest 
Importance 


5 
5 
5 

5 

5 
5 

5 

5 

5 
5 


6 
6 
6 
6 

6 

6 

6 
6 

6 
6 


6 
6 
6 


7 

7 

7 

7 

7 
7 

7 

7 

7 
7 


7 
7 
7 


FIGURE  6.1.   SUCCESSIVE  CATEGORY  QUESTIONNAIRE  ON  SHOPPING  DESTINATION  CHOICE. 

(After  Kannel  [10]) 
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Table  6.1.  Attitudinal  Ratings  of  Factors  Important  in  Selection 
of  a  Shopping  Trip  Destination. 
C\fter  Kannel  [10]} 


Generalized  Descriptor 


Average 
Rating* 


Rank 
Order 


Familiar  area 

Shop  at  several  stores 

Employees  to  help  you 

Wide  selection 

Shortest  trip  time 

Parking  cost 

Go  where  others  shop 

Roads  not  congested 

Walking  distance  from  vehicle 

Easy  to  return  home 

Ride  to  several  stores 

Quickly  find  place  to  park 

Travel  cost 

Distance 

Pleasant  appearance 


5.07 
5.61 
5.01 
6.25 
4.74 
4.81 
3.31 
5.30 
5.72 
5.67 
4.36 
5.61 
4.74 
4.79 
5.71 


8 

5  1/2 

9 

1 
13 
10 
15 

7 

2 

4 
14 

5  1/2 
12 
11 

3 


^Maximum  rating  =  7.0 
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in  rating  scales  for  subgroups  within  this  small  sample  for  the 
specially  selected  group  of  relatively  stable  households.  The 
results  of  a  Kolmogorov-Srairnov  test  for  differences  in  shopping 
destination  choice  influencing  factor  perceptions  by  subgroups  as 
determined  by  Kannel  gave  support  to  this.    A  way  to  use  the  atti- 
tudinal  data  is  to  classify  individuals  Cnot;  groups}  according  to 
their  response  patterns  on  the  successive  category  rating  form.  With 
a  substantial  sample  size  a  3-  or  4-way  classification  could  be 
accomplished.   It  is,  however,  impractical  to  consider  anything  be- 
yond a  2-way  classification,  considering  the  limited  sample  size 
available  and  the  fact  that  the  factors  are  not  independent.  Seven 
factors  were  therefore  selected,  considering  the  average  rankings  in 
Table  6.1,  their  predominance  in  previous  traditional  destination 
choice  models  and  the  possibility  that  two  or  more  factors  semanti- 
cal ly  could  be  reduced  to  one  descriptor.  The  factors  selected  for 
later  detailed  work  are  listed  in  Table  6,2. 

Table  6.2.  Shopping  Destination  Choice  Factors  Considered  in  this 
Research 

1.  Many  Employees  in  Shopping  Establishment 

2.  Wide  Selection  of  Goods 

3.  Shopping  Destination  Requiring  Shortest  Travel  Time 

4.  Uncongested  Approach  Roads  to  Shopping  Destination 

5.  Desire  to  Visit  Several  Stores  at  Different  Locations 

6.  The  Ability  to  Quickly  Find  a  Parking  Space 

7.  A  Destination  Closest  to  Home 
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Some  Approaches  for  Successive  Category  Data 

The  discriminal  process  (not  directly  observable  but  playing  a 
central  role  in  judgemental  models)  associated  with  a  stimulus  xj  will 
be  represented  as  a  random  variable  Vj  [86]  .  A  value  of  Vj^  for  a 
randomly  sampled  subject,  t,  is  considered  to  be  composed  of  a  fixed 
component  y.,  specific  to  x^ ,  but  common  to  all  subjects,  and  e^^, 
a  random  component,  i.e.  v-^  =  11^+  ej^.  For  successive  categories 
a  second  discriminal  process  is  postulated  which  pertains  to  the 
perception  of  a  given  point  on  the  rating  form: 

^kt  =  "^k  +  "t  *  ^kt 
X.  is  a  fixed  component  associated  with  point  k  on  the  continum 

represented  by  the  rating  form.  II  is  an  individual  difference 
component  that  depends  on  the  interpretation  of  the  rating  form  for 
a  particular  subject  t.  e^^   is  a  random  component  allowing  for  vari- 
ability of  interpretations  over  subject,  t,  of  point  k  on  the  rating 
form. 

The  joint  distribution  of  e.  and  11^  +  ej^^  is  assumed  to  be 

■"^  2       2 

bivariate  normal  with  means  zero  and  variances  6 .   and  y,  and  inter- 

'        2        2 
correlation  zero.  It  is  further  assumed  [86]  that  y    =y     for  all 

values  of  k. 

Subject  i  will  rate  stimulus  j  at  or  below  point  k  if 

Vjki  =  Vji  -  ^ki  =  ^j  -  ^k  *  ^ji  -  ^kiio 

v.,  is  normally  distributed  with  means  y .  -  Ti,  and  variance 
jk  '  3  '^ 

2    2     2 
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With  the  normality  assumption,  the  probability  that  a  randomly 
selected  subject  i  will  judge  stimulus  x^  at  or  below  point  k  on  the 
rating  form  is  given  by 

Pjki  =  Z=Z        /    ^^PC  -  2   ^)dy  6.1 

a,/^n    _„  2 

J  aj 

where  y  is  the  normal  deviate  corresponding  to  a  unit  normal  distribu- 
tion. 

Bock  and  Jones  [86]  have  discussed  graphical  and  minimum  Normit 
^2  procedures  for  estimating  Yv>  b-  and  p.  in  equation  6.1.  To  simpli- 
fy the  normit  y^   procedure  for  successive-category  data,  it  is  necess- 
ary to  assume  that  the  responses  to  different  objects  or  stimuli  for 
the  sample  in  question  are  independently  distributed.  Such  independ- 
ence is  not  strictly  true  since  the  same  individuals  are  responding  to 
all  the  objects.  An  extreme  case  is  when  objects  are  presented  to 
all  subjects  in  the  same  order  and  two  similar  objects  follow  each 
other.   In  such  a  case  the  distributions  of  the  response  frequencies 
will  be  similar.  If  the  order  in  which  the  objects  are  presented 
to  the  individuals  is  randomized  the  dependency  in  response  distri- 
butions will  be  minimized.   If  the  correlation  of  the  error  term  is 
roughly  constant  and  is  within  some  plausible  bounds  the  expressions 
for  the  expected  sums  of  squares  and  sampling  variance  of  parameter 
estimators  can  be  generalized  in  the  correlation  case.  The  analysis 
can  easily  get  unwieldy  in  such  a  situation.  One  could  also  investi- 
gate the  need  to  use  the  procedure  that  always  considers  such 
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correlation  as  a  routine  measure  in  cases  where  the  correlation  may 
be  statistically  different  from  zero  but  judged  to  be  of  a  low  order. 

The  correlation  matrix  for  the  response  frequencies  to  the 
factors  in  Table  6.2  is  shown  in  Table  6,3.  Though  most  of  the 
correlations  are  significantly  different  from  zero  at  the  1-percent 
level  very  few  of  them  have  an  absolute  value  that  exceeds  0.4  in 
which  case  the  lack  of  independence  may  not  completely  vitiate  the 
analysis.  Besides,  the  independence  assumption,  the  normality 
assumption  implied  in  the  above  discussion  must  be  verified  for  the 
available  data.  Figure  6.2  shows  the  frequency  distributions  for 
the  response  categories  for  the  seven  factors  under  consideration. 
The  Kolmogorov-Smirnov  normality  tests  for  the  response  frequencies 
reject  the  normality  assumption  for  all  cases.  In  view  of  the 
nature  of  the  frequency  distributions  in  pigure  6. 2, some  of  the  rating 
categories  were  combined  after  which  tests  of  pairwise  independence 
in  contingency  tables  were  conducted  for  two  different  sets  of 
3x3  and  a  set  of  2x2  contingency  tables  as  shown  in  Tables  6,4a 
to  6.6.  The  test  of  independence  is  based  on  the  goodness-of-fit 
X  value  for  main  effects  models.   Except  for  a  couple  of  cases, 
the  assumption  of  pairwise  independence  is  rejected  at  the  1-percent 
level  in  all  cases  for  the  3x3  tables.  In  the  2x2  tables  the 
assumption  of  independence  is  rejected  in  all  but  two  cases.  The 
effect  of  the  lack  of  independence  will  be  investigated  in  a 
later  section  assuming  a  logistic  distribution  for  the  response 
categories  for  selected  combinations  of  factors.  The  importance  of 
this  analysis  lies  in  the  fact  that  one  is  interested  in  selecting 
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Table   6-3 


Correlation     Matrix     For    7   Shopping 
Destination    Choice    Factors. 


1 
Many    Employees 

2 
Wide   Selection 

3 
Shortest   Time 

4 
Uncongested   Roads 

5 
Several   Stores 

6 
Quick   Parking 

7 
Close    To    Home 

1 

— 

.216 

.127 

.205 

.260 

.108 

.188 

2 

— 

.201 

.244 

.182 

.262 

.173 

3 

— 

.262 

.176 

.235 

.405 

4 

— 

.310 

.432 

.304 

5 

— 

.314 

.296 

6 

• 

— 

.266 

7 

— 
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FIGURE    6.2 
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RESPONSE     CATEGORIES    OF    SEVEN 
DESTINATION    CHOICE     FACTORS 
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the  most  effective  shopping  destination  choice  influencing  factors 
for  model  development  in  Qiapter  7  and  finding  out  if  interactions 
among  the  attitudinal  factors  need  to  be  considered.  The  method  of 
analysis  to  be  used  will  be  a  special  case  of  the  multinomial  response 
relation  model  that  can  be  used  for  a  factorial  experiment  or  a  cross- 
classification  problem.  Since  a  linear  model  for  the  effects  of 
cross-classification  of  subjects  according  to  their  response  categor- 
ies to  objects  is  the  same  as  that  for  a  factorial  design  on  the 
objects,  the  theoretical  presentation  given  by  Bock  and  Jones  [86] 
for  a  factorial  experiment,  as  briefly  discussed  below,  is  pertin- 
ent. 

The  incorporation  of  the  linear  model  of  a  factorial  design 
into  the  measurement  model  for  the  method  of  successive  intervals, 
assumes  equal  discriminal  dispersions  (i.e.  6.  =  6  for  all  j). 

A  factorial  experiment  is  assumed  in  which  the  response  to 
the  treatment  combinations  consists  of  independent  judgements  obtain- 
ed from  a  rating  form.  Assuming  the  distribution  of  judgement  into 
n+1  categories  follows  Thurstone's  law  of  categorical  judgement,  the 
probability  that  a  response  to  the  h   treatment  combination  (or 
"cell")  will  fall  in  or  below  the  k   successive  category  is: 


Xhk  =  Cxk  -  yj^l  /<Sj, 


T,  is  a  parameter  specifying  the  upper  boundary  of  an  interval 
associated  with  the  k   category  of  the  rating  scale,  y,  is  a  para- 
meter that  indicates  the  effect  of  the  h^"  treatment.   6,  is  a  scale 
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parameter  for  the  h   treatment  combination  and  g(2)  is  a  probabil- 
ity density  function.  We  assume  the  logistic  form  for  g(z)  as 
follows: 

g(z)  =  J  sech  -^ 

from  which  Pj^j^  =  1/(1.  -Y^^) 

where      Y^^  =   log  [Phk/Cl-Phk)] 

It  is  assumed  that  ^^  is  sufficiently  homogeneous  in  which  case 
6=6  for  all  h.  Without  loss  of  generality  6  can  be  taken  to  be 
1  by  a  suitable  choice  of  scaling  unit.   The  linear  model  for  a  treat- 
ment combinations  is  then  expressed  as: 

M  =     A       5 
r\Stl     nion  mil 

If  A  is  singular  of  rank  r  <  m,r  ^  n  a  reparameterization  of  the 

model  in  terms  of  r  linear  estimable  functions  of  the  parameters  in 

the  form  of  6  =  L  £ .  becomes  necessary.   This  has  been  previously 

rxm 
discussed  in  Chapter  5  in  connection  with  the  multinomial  response 

relation  model.   It  follows  that 

y  =  ^  e 

nxr  rxl 

where  jj=k'e  h  =  l,  2,...,n 

where  k'  is  the  row  of  JC  corresponding  to  treatment  combination  h. 
-h 

The  response  model  in  equation  6.2  becomes: 


^^       2  _„  sech^Z/2dz  6.3 
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The  problem  then  becomes  that  of  estimating  x  and  6  from  independent 
judgements  randomly  sampled  from  populations  defined  by  equation 
6.3.   Bock  and  Jones  [86]  have  discussed  a  logit  X  solution  for  the 
parameter  estimation  involved.  The  procedure  that  was  used  in 
subsequent  work  in  this  chapterwas  maximum  likelihood  estimation  based 
on  the  multinomial  response  relation  model,  using  a  symmetrical  log- 
istic distribution.   Bock  [66]  has  discussed  the  connection  between 
the  asymmetrical  and  symmetrical  distribution  situations  for  binary 
choice  and  the  extension  of  the  symmetrical  case  for  a  multiple-choice 
problem  as  discussed  in  Chapter  5. 

The  Reproduction  of  Attitudinal  Orientation  Patterns 

In  order  to  make  attitudinal  information  obtained  from  a  survey 
useful  for  present  day  or  short-terra  forecasting,  one  has  to  be  able 
to  reproduce  the  pattern  of  attitudinal  orientation  obtained  from 
the  survey  through  some  independent  variables  or  factors  or  with  a 
suitable  alternative  technique.  Though  it  may  not  yet  be  possible 
to  use  attitudinal  information  for  long-range  forecasting,  the  con- 
sistent good  performance  of  some  attitudinal  factors  that  are  direct- 
ly associated  with  some  objectively  measurable  variables  or  factors 
puts  the  transportation  planner  in  a  position  to  improve  future 
data  collection.  The  need  for  such  improved  future  data  collection 
in  destination  choice  modelling  is  becoming  increasingly  clearer 
since  the  currently  available  zonal  retail  floor  area  and  retail 
employment  attraction  measures  are  unable  to  reveal  all  the  intricac- 
ies in  destination  choice  behaviour.  Whatever  the  technique  used 
for  reproducing  attitudinal  orientation  patterns,  one  has  to  know 
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how  the  factors  act  jointly  or  in  statistical  terms,  one  would  be 
interested  in  the  joint  distribution  of  the  factors.   In  the  case  of 
two-way  associations  among  the  factors,  the  correlation  coefficient 
is  often  used  to  determine  the  degree  the  two  factors  concerned 
might  confound  each  other  in  the  same  model.  Quite  often  there  can 
be  some  2-way  or  3-way  interactions  among  some  of  the  factors  that 
are  more  conveniently  measured.  Since  such  factors  are  not  easily 
dispensed  with,  the  interaction  effects  must  be  included  in  model 
building.  Assuming  the  availability  of  independent  destination  choice 
attitudinal  factors,  the  question  of  what  further  independent  vari- 
ables or  factors  can  be  used  to  predict  the  observed  attitudinal 
orientation  patterns  still  prevails.  Little  success  has  been  exper- 
ienced with  the  use  of  scio-economic  variables  for  this  purpose. 
Extensive  research  is  needed,  with  greater  demands  for  data  collec- 
tion and  complicated  analysis  in  order  to  approach  the  threshold  of 
the  problem.  The  likely  danger  is  that  the  problem  can  become  com- 
plicated, making  the  analyst  lose  sight  of  the  nature  of  the  original 
transportation  planning  problem — the  prediction  of  aggregate  flows 
of  primary  traffic  on  a  transportation  network.  Considering  the 
problem  in  the  context  of  an  experimental  design,  as  has  been 
previously  argued,  the  reproduction  of  attitudinal  orientation  can 
be  accomplished  at  a  level  quite  adequate  for  the  current  level  of 
data  availability,  allowing  for  interactions  among  the  attitudinal 
factors.  This  is  a  direct  way  of  incorporating  attitudinal  factors 
in  travel  demand  modelling  rather  than  going  through  independent 
predictive  factors  that  are  extremely  difficult  to  identify. 
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The  procedure  used  in  this  research  to  reproduce  attitudinal 
orientation  patterns  uses  cross-classifications  with  respect  to 
categories  of  successive  category  rating  ranges.  It  must  be  pointed 
out  that  a  crossed  design  is  not  the  only  one  that  can  be  used.  Nest- 
ing or  other  designs  can  also  be  used  provided  that  they  are  planned 
for  in  the  original  data  collection. 

The  multinomial  response  relation  technique,  the  theory  of 
which  was  presented  in  Chapter  5,  is  sufficiently  general  to  handle 
all  the  design  situations.  Attempts  at  reproducing  attitudinal  pat- 
terns using  two  different  sets  of  3x3  and  one  set  of  2x2  cross-classi- 
fication tables  will  be  discussed.  The  patterns  for  those  tables  are 
shown  in  Figure  6J5.   Since  a  more  detailed  discussion  of  the  empiri- 
cal application  of  the  multinomial  response  relation  procedure  has 
been  discussed  in  Chapter  5  (with  detailed  examples  in  Appendix  A) 
only  the  essential  points  for  the  interpretation  of  models  fitting 
the  observed  attitudinal  patterns  for  348  households  will  be  present- 
ed in  the  remainder  of  this  chapter.  Main  effect  parameters  are  all 
relative  to  the  last  category  of  classification.  In  the  3x3  tables, 
parameter  estimates  for  each  way  of  classification  are  for  category 
1  relative  to  category  3  and  for  category  2  relative  to  category  3. 
In  the  2x2  tables,  the  category  1  effect  is  estimated  relative  to 
category  2  for  each  way  of  classification.  An  I, J  interaction  in  a 
3x3  table  is,  therefore,  the  interaction  of  level  I  relative  to 
level  3  with  level  J  relative  to  level  3. 

When  all  interactions  are  negligible,  the  model  fits  the  data 
well  and  the  two  factors  can  be  considered  to  be  independent. 
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A  main  effects  model,  therefore,  is  expected  to  reproduce  the  attitu- 

2 
dinal  orientation  pattern  well.  A  significant  goodness-of-fit  X 

value  for  a  given  model  indicates  bad  reproducibility  of  the  observed 

attitudinal  orientation  pattern.  Each  interaction  is  associated  with 

1  degree  of  freedom  and  each  interaction  fitted,  if  significant,  must 

2 
reduce  the  goodness-of-fit  X  by  at  least  an  amount  corresponding  to 

2 
the  X  value  for  1  degree  of  freedom  at  a  given  significance  level. 

For  the  purposes  of  the  work  being  done  in  this  research,  if  the 
goodness-of-fit  X  is  significant  for  a  model  for  reproducing  the 
attitudinal  orientation  pattern  due  to  two  factors,  the  factors  con- 
cerned cannot  be  considered  simultaneously  in  any  model  of  travel 
behavior.   In  such  a  situation  there  usually  is  a  systematic  way  of 
association  between  the  two  factors. 

Introduction  of  Tables  that  Present  Models  for  Predicting  Attitudinal 
Orientation  Patterns 

The  presentation  of  the  tables  discussed  in  the  remaining  sec- 
tions of  this  chapter  is  to  enable  the  reader  to  get  the  essential 
objectives  of  the  tables  before  they  are  discussed  in  detail.  Sever- 
al of  the  tables  refer  to  Case  I,  Case  II  or  Case  III  models  which 
are  models  based  on  cross-classification  designs  presented  in  Figure 

6.3. 

2 
Tables  6.4a  to  6.8  give  goodness-of-fit  X  values  associated 

with  models  for  predicting  response  patterns  for  pairings  of 

attitudinal  factors  in  2-way  contingency  tables.  Cases  I  and  II 

models  are  for  3x3  contingency  tables.  Case  III  models  refer  to 

2x2  contingency  tables.  For  Case  I  models  both  tables  6.4a  and 
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9 

6.4b  show  four  layers  of  goodness-of-fit  X  values.   In  Table  6.4a 
the  layers,  starting  from  the  top,  represent  models  with  the  follow- 
ing effects: 

2 

1.  Main  Effects  (Pearsonian  X  value  computed) 

2.  Main  Effects  (Likelihood  Ratio  X^  value  computed) 

3.  Main  Effect  +  1,1  Interaction 

4.  Main  Effects  +  1,2  Interaction 

The  four  layers  of  Likelihood  Ratio  X^  values  in  Table  6.4b,  from 
the  top,  represent  models  with  the  following  effects: 

1.  Main  Effects 

2.  Main  Effects  +  2,1  Interaction 

3.  Main  Effects  +  2,2  Interaction 

4.  Main  Effects  +  1,1  +  2,2  Interactions 

The  above  description  of  the  layers  of  goodness-of-fit  X^ 
values  for  Case  I  models  in  Tables  6.4a  and  6.4b  apply  to  the  corres- 
ponding X^  values  for  Case  II  models  displayed  in  Tables  6.5a  and 
6.5b  respectively. 

The  two  layers  of  goodness-of-fit  X  values  for  Case  III  models, 
given  in  Table  6.6  represent  Pearsonian  and  Likelihood  Ratio  X^ 
values  respectively. 

Tables  6.7  and  6.8  show  reductions  in  goodness-of-fit  X^  values 
achieved,  compared  to  the  main  effects  model,  by  considering  1,1 
and  2,2  interaction  effects  separately  in  Case  I  and  Case. II  model 
development.  The  entries  in  Tables  6.7  and  6.8  were  obtained  from 
appropriate  subtractions  in  Tables  6.4a  and  6.5a.  for  reductions 
caused  by  the  1,1  interaction  and  in  Tables  6.4b  and  6.5b  for 
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reductions  caused  by  the  2,2  interaction.  Each  X^  value  in  Tables  6.7 
and  6.8  has  one  degree  of  freedom. 

Tables  6.9a  to  6.9c  present  Case  III  attitudinal  orientation 
pattern  prediction  models  that  displayed  an  insignificant  lack  of  fit. 
Tables  6.10  and  6.11  are  corresponding  tables  for  main  effects  Case  I 
and  Case  II  models,  respectively. 

Tables  6.12a  to  6.12e  present  five  attitudinal  orientation 

pattern  prediction  models  that  achieved  an  improved  goodness-of-fit 

2 
X  value  as  a  result  of  the  inclusion  of  a  1,1  interaction  in  Case  I 

prediction  models.  Tables  6,13a  to  6.13d  present  corresponding 

Case  II  models  that  included  the  1,1  interaction  effect. 

Tables  6.14a  and  6.14b  show  two  Case  II  models  that  obtained  a 
significant  improved  goodness-of-fit  with  the  inclusion  of  main 
effects  and  2,2  interactions  in  the  predictive  models  developed. 

Tables  6.15a  to  6.15e  present  results  for  six  Case  I  models  that 
consider  main  effects  +  1,1  and  2,2  interactions  in  the  development 
of  models  to  predict  the  attitudinal  orientation  patterns  in  3x3 
contingency  tables.  Tables  6.16a  to  6.16J  summarize  the  results  for 
eight  corresponding  predictive  models  that  consider  similar  effects 
for  Case  II  3x3  contingency  tables.  It  is  noticed  that  the  models 
with  main  effects  +  1,1  and  2,2  interactions  have  very  low  goodness- 
of-fit  X^  values  compared  to  their  equivalent  main  effects  models. 

In  all  the  attitudinal  orientation  patterns  prediction  models 
discussed  above,  row  parameters  refer  to  the  effects  of  the  row  way 
of  classification.  The  column  parameters  correspond  to  the  effects 
of  the  column  way  of  classification.   The  ways  of  classification  in 
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the  above  models  refer  to  the  attitudinal  factors  that  define  corres- 
ponding contingency  tables. 

Main  Effects  Models 


Tables  6.4a  and  6.5a  show  that  no  pair  of  factors  produce  an 
attitudinal  orientation  pattern  that  can  be  reproduced  effectively 
with  a  main  effects  model.  The  combination  of  "Many  Employees"  and 
"Shorter  Time"  barely  misses  being  a  good  model  for  Case  I  in  Table 

6.4a  and  produces  a  good  model  for  Case  II  in  Table  6.5a.  The 

2  9 

Pearsonian  X  and  Likelihood  Ratio  (L.R.)  X'^  values  are  expected  to 

be  close.  The  presence  of  both  is  a  checking  device.  The  Case  III 
2x2  model  in  Table  6.6  gives  a  good  fit  for  "Many  Employees"  by 
"Wide  Selection,"  "Wide  Selection  by  "Shorter  Time"  and  "Wide 
Selection"  by  "Uncongested  Roads."  With  Case  III  the  categories 
have  been  combined  so  much  that  the  effectiveness  of  the  joint  dis- 
tribution between  a  pair  of  factors  is  likely  to  have  been  reduced 
in  the  process  of  combining  categories  in  the  various  ways  of  class- 
ification. Not  much  weight  need  be  attached  to  the  results  for 
Case  III  except  for  the  fact  that  the  independent  classifications 
provide  a  2x2  representation  of  the  data  which  is  useful  for  illus- 
trating the  need  for  a  2-way  classification  for  the  small  sample 
size  available.  Cases  I  and  II  in  Tables  6.4a  and  6.5a  respectively 
show  that  except  for  the  pairing  of  "Many  Employees"  with  "Shorter 
Time,"  no  pair  of  attitudinal  factors  can  be  considered  as  acting 
independently.  The  observed  and  predicted  frequency  distributions 
for  the  classification  tables  discussed  in  this  Section  are  indicated 
in  Tables  6.9a  to  6.9c  and  Tables  6.10  and  6.11. 
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X  Values  In  2-Way  Tables  -  Case  I 


Many 
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31.19 
33.95* 
29.40 
27.53* 


10.01. 
9.59 
3.38 
9.07 
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26.02 

9.30 

20.44* 


58.08^ 
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24.61^ 
25.36 
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12.21** 
12.20** 

8.41* 
10.04* 


Wide 
Selection 


14.90* 

13.55* 

4.00 

12.81* 


31.83** 17.98' 


25.73 
12. i9* 
24.83" 


20.00' 
17.80* 
14.07* 


51.24** 
36.11** 
14.29 
35.57** 


77.41** 
72.26** 
32.59** 
69.71** 


3 

ShorteHt 
Time 


34.88** 
35.73** 
25.86** 
34.34** 


25.92 
24.83* 
15.46* 
22.61* 


21.11** 
21.43** 
15.95** 
20.65** 


66.21** 
63.06** 
40.66** 


DncongesCed 
Roads 


49.52** 
47.33** 
17.93** 


47.32**   62.70* 


43.23  * 
40.91** 
19.48** 
40.66** 


Several 
Stores 


40.06** 
41.79** 
27.68** 
38.52** 


46.55** 
45.09** 
24.83** 
43.81** 


28.38** 
26.92** 
11.07** 
25.70** 


6 

Quick 
Parking 


Close  to 
Home 


Significance  Level  <  .01 
.01  <  Significance  Level  <  .05 


d.f.is  as  indicated  below  unless  other- 
wise shovn  in  parenthesis 


layers  from  top 


1.  Pearsonian 


2.  L-R- 


L-R'y'^ 
L-R--:2 


Main  Effects  (4  d.f.) 


Main  Effects  (4  d.f.) 

Main  Effects  +  1,1  Interaction 

Main  Effects  +  1,2  Interaction 


(3  d.f.) 
(3  d.f.) 


Table  6.4b     )^'-Value8  In  2-Way  Tables'-  Case  I 
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Many 
Employees 


33.95 
28.66 
25.16" 
21.03 


** 


** 


9.59 
9.40* 
7.32 
2.07 


26.02 
25.34' 
16.40** 
2.06 


55.88 
54.20* 
32.66* 
18.93 


** 


25.36 
23.64* 
21.35 
13.21* 


** 


22.71 
20.44' 
12.15** 
2.00 


inr 
** 


25.73** 

21.49** 

13.28** 

0.75 


36.11** 

35.77** 

22.53** 

1.44 


12.20** 

12.20** 

10.59* 

6.96* 


Wide 
Selection 


13.55 

12.59 

12.75 

3.41 


** 


20.00** 
19.77** 
18.38** 
16.34** 


** 


Shortest 
Tine 


35.73' 
35.47 


** 
** 


26.90 
19.01** 


24.83"" 

23.74** 

11.55** 

5.96 


21.43** 
17.21** 
18.72** 
13.83** 


72.26 
68.41 
46.31 
15.49 


** 


** 


Uncongested 
Roads 


47.33' 
42,39 
37.82' 
12.38 


** 


** 


63.06 
53.52 
54.98* 
34.08* 


40.91"" 
39.19** 
31.48** 
12.84** 


Tsr- 


5 

Several 
Stores 


41.79 
39.71 
32.79 
20.67 


** 


45.09 
44.92* 
32. 99^ 
17.67 


** 


6 
Quick 
Parking 


26.92** 

22.27** 

23.20** 

8.42** 


Close  to 
Hone 


** 


Significance  Level  <  .01 
.01  <  Significance  Level  <  .05 


d'f  Is  as  Indicated  below  unless 
otherwise  shown  In  parenthesis 

layers  from  top 


1. 

L.R.^Z  _  Main  Effects 

(4  d.f.) 

2. 

L'R'X^  -  Main  Effects 
action  (3d.f.) 

+2,1  Inter 

3. 

L-R-x  -  Main  Effects 
action  (3d.f.) 

+2,2  Inter 

4. 

L-R-x^  -  Main  Effects 

+  1,1  +  2,2 

Interactions  (2  d.f.) 


Table  6.5a  x    Values  In  2-Way  Tables*-  Case  II  ,  Main  Effects 
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.3 

■H    U 

3S  6 

s 

•  1) 

1 

Many 
Employees 

24.16** 
25.12** 
23.28** 
25.11** 

8.61 
8.26 
3.19 
7.82 

14.93** 
14.33** 

5.04 
14.10** 

46.68** 
48.36** 
21.15** 
48.30** 

10.82* 

10.44* 

5.79 

8.17* 

14.69** 
14.24** 

14.14 

2 
Vide 
Selection 



16.53** 
14.68** 
10.89** 
14.64** 

18.23** 
17.56** 
13.85** 
15.96** 

26.71** 
32.76**1 
17.60**( 
27.21**( 

35.84** 
[3)29.06** 
[2)12.74** 
[2)28.97** 

19.32** 
19.58** 
13.78** 
18.54** 

3 
Shortest 
Time 



37.64"* 
35.47** 
22.57** 
28.95** 

20.62** 
20.62** 

5.62 
19.63** 

21.56"* 
23.11** 
18.38** 
21.94 

51.97*" 
52.08** 
20.88** 
51.99** 

4 
Uncongested 
Roads 

31.61** 
32.85** 
13.16** 
32.57** 

56.95"" 
54.67** 
27.75** 
54.16** 

37.86** 
37.77** 

9.35* 
37.75** 

5 
Several 
Stores 



32.58** 
35.87** 
14.55** 
34.46** 

42.14** 
41.54** 
26.32** 
41.10** 

6 
Quick 
Parking 



22.40* 
22.36 

7 

Close  to                                                                                                                                         

Home 

** 


significance  Level  <  .01 
.01  <  significance  Level  <  .05 


d'f  Is  as  indicated  below  unless 
otherwise  shown  in  parenthesis 

Layers  from  top 

1.  Pearsonian  x '^  -  Main  Effects  (4  d.f.) 

2.  L-R-X^  -  Main  Effects  (4  d.f.) 

3.  L*R0(2  -  Main  Effects  +  1,1  inter 
action  (3  d.f.) 

4.  L'R^X   -  Main  Effects  +1,2  Inter- 
action (  3  d.f.) 


Table  6.5b  x  "  Values  in  2-Way  Tablea*-  Case  II 
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I 

o 

V   « 

•a  iH 


It 


U    « 
CO   H 


« 

a 

4) 

e  a 

O  TS 

u  a 
a  o 


>  o 

M   CO 


•H     k< 


«l 
(S    1) 


Many 
Employees 


25.12 
22.56' 
15.27* 
13.76* 


** 


8.26 
7.90 
5.35 
1.16 


14.33 

13.93* 

10.43 

2.22 


48.36** 
48.23** 

21.04 


10.44 

10.40 

6.32 

2.25 


14.24 

13.72J 

10.25 

4.97 


** 


17.56**        32.76**(3)29.06**        19.58 


Wide 
Selection: 


14.68" 

13.09* 

5.39 

2.07 


15.29' 

10.36* 

6.97* 


N.C. 


27.87' 


31.98**(2)18.10** 


17.10**(1) 


2.42 


15.47 
12.99 
7.55* 


** 


Shortest 
Time 


35.47** 
33.14** 
26.76** 
12.31** 


20.62' 

20.59* 

19.19* 

5.28 


23.11' 
16.81* 
20.58* 
16.32* 


52.08** 
51.85** 
47.90** 
19.04** 


** 


4 

Dncongested 
Roads 


32.85 

32.84* 

31.53* 
12.77* 


inr 


54.67 
51.63* 
44.67* 
19.54* 


37.77 

37.12 

35.10* 

8.07* 


mr 


5 
Several 
Stores 


35.87 

28.95' 

21.83 

2.44 


** 


41.54 
41.06 
32.61* 
20.48* 


** 


** 


6 

Quick 
Parking 


22.40 

19.91 

14.94* 

4.02 


Close  to 
Home 


** 


Significance  Level  <  .01 


.01  <  Significance  Level  <  .05 
N.C.  -  No  Convergence 


d*f  is  as  indicated  below  unless 
otherwise  shown  in  parenthesis 


^Layers  from  top 
1.  L-R-X'^  -  Main  Effects 


2.  L-R-X-^ 


(4  d.f.) 
-  Main  Effects  +  2,1  Inter- 


action (3  d.f.) 

3.  L-R-X^  -  Main  Effects  +  2,2  Inter- 
action (3  d.f.) 

4.  L-R-X^  -  Main  Effects  +  1,1  +  2,2 
Interactions  (2  d.f.) 
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Table  6.6  X^  "  Values 
Cross-Class 

:  in  2-Way 
ilfication, 

Tables,  Main  Effects  Only*-  Case  III  -  2 
,  D.  F.  -  1  in  all  Cases. 

;  X  2 

1 

5^1 

1 

u 
u 

3  W  2 

u 

m 
« 
u 

^% 

J3  1-1 
m  H  3 

1 

• 
a) 

M 

a  a 

O  T3 

U  IS 

a  o 

3*4 

i-i 

e»   go 
U    01 

V  U 

>  o 
a)   4J 

1-1  ki 

o 

u 

o  a 

ft    O 

1 
Many 
Employees 



1.38 
1.84 

4'.  59* 
5.08* 

8.72** 
9.29* 

25.62** 
27.21* 

4.18* 
4.65* 

5.85* 
6.38 

2 
Wide 
Selection 

3.08 
3.79 

3.19 
3.71 

12.64** 
15.16** 

18.12** 
16.32** 

5.00* 
5.81* 

3 
Shortest 
Tine 

- 



12.21** 
13.05** 

14.07** 
14.99** 

4.19* 
4.74* 

29.69** 
31.20** 

4 
Uncongested 
Roads 

18.34** 
19.69** 

27.56** 
26.92** 

27.34** 
28.42** 

5 
Several 
Stores 



12.98** 
14.26** 

14.27** 
15.22** 

6 
Quick 
Parking 



11.40** 
12.16** 

7 

Close  to                                                        

Home 

Significance  Level 
*■  .01  <  Significance 

<  .01 
Level  <  .05 

Layers  from  top 

1.  Pearsonian  -^ 

2.  L-R-x^ 

► 
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Models  With  Main  Effects  and  1,1  Interactions 

For  Case  I  in  Table  6.4a  combinations  of  "Many  Employees"  with 
"Shorter  Time"  and  "Wide  Selection"  with  "Shorter  Time"  give  reason- 
ably good  fits.  Pairings  that  come  close  to  a  fair  reproducibility 
of  observed  attitudinal  orientation  patterns  are  "Many  Employees" 
with  "Uncongested  Roads"  "Many  Employees"  with  "Close  to  Home"  and 
"Wide  Selection"  with  "Close  to  Home."  For  Case  II  in  Table  6.5a  an 
improved  fit  for  "Many  Employees"  paired  with  "Shorter  Time"  is 
provided  and  the  fits  are  good  for  "Many  Employees"  with  "Uncongested 
Roads,"  "Many  Employees"  with  "Quick  Parking"  and  "Shorter  Time" 
with  "Several  Stores."  The  fitting  for  an  interaction  indicates  that 
the  model  allows  for  a  systematic  change  in  the  relationship  between 
two  factors  at  the  combination  of  factor  levels  corresponding  to  the 
interaction.  A  1,1  interaction  between  "Many  Employees"  and  "Shortest 
Time"  indicates  that  those  to  whom  the  presence  of  many  employees  is 
not  important  do  not  also  place  a  high  value  on  getting  to  their 
shopping  destination  as  early  as  possible.  A  similar  interpretation 
can  be  given  to  other   models  that  fitted  this  interaction  well  as 
pointed  out  above.  The  observed  and  predicted  frequency  distribu- 
tions of  attitudinal  orientation  patterns  for  the  models  discussed 
in  this  paragraph  are  shown  in  Tables  6,12a  to  6.12e  and  Tables  6.13a 
to  6,13d.  The  X^  reductions  due  to  the  1,1  interactions  are  given 
in  Tables  6.7  and  6.8, 

Models  With  Main  Effects  and  1,2  or  2,1  Interactions 

Tables  6.4a,  6,4b,  6.5a  and  6.5b  indicate  that  the  introduction 
of  1,2  or  2,1  interactions  in  addition  to  main  effects  does  not 
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Table  6.7 

X^  -  Values  in  2-Way 

Tables  *- 

Case  I,  Interactions 

' 

1 

II, 

u 

5 

u 
u 

4)  4) 

•rt  4) 

3  w  2 

« 

4» 

II 

4) 

W 

a 
«) 

60 

C 

o 

o 

■s 

0 

•  4 

n  m 

U    4) 
4)  M 

>  O 

4)  4J 

M 

•H  kl 
3  CO 

0 

8 

41 
00  tl 

o  a 

r-t     O 

"*7 

1 
Many 
Employees 

8.79** 

6.2^ 
2.27 

9.62 

23.22 

8.94** 
4.01 

13.00  :; 

10.56 

2 
Wide 
Selection. 



9.55** 
0.80 

13.24** 
12.45** 

2.20 
1.62 

21.82  ** 
13.58** 

3.79 
1.61 

3 

Shortest 
Tine 



9.87  ** 
8.83** 

9.37** 
13.28** 

5.48* 
2.71 

39.67  ** 
25.95** 

4 
Dncongesced 
Roads 

• 

29.40** 
9.51** 

22.40** 
8.01  ** 

21.43** 
9.43** 

5 
Several 
Stores 

14.11** 
9.00** 

20.26  * 
12.10** 

6 
Quick 
Parking 



15.85** 
3.72 

7 
Close  to 
Home 

Significance  level 
.01  <  Significance 

<  .01 

level  <  .05 

Layers  from  top 

1.  L-R-X*^  due  to  1,1  Interaction 

2.  L'R'X-^  due  to  2,2  Interaction 
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Table  6.8 


Values  in  2-Way  Tables  -  Case  II,  Interactions 


« 
o 

C    p. 


5^1     S 


§ 


o 

•O   1-1 

«l 

9    CA 


a 

01 

u 

U    01 

O     S 

to  H 


«l 

CO 

C  IB 

O  T) 

U  M 

B  O 

:3  OS 


Id  OS 

b  01 

0)  M 

>  o 


O-flu 


0) 

r-l     O 
O    X 


Many 
Employees 


1.84. 
9.85 


** 


5.07" 
3.91* 


9.29 
3.90* 


,** 


27.21' 
1.24 


** 


4.65* 
4.12* 


6.38** 
3.99* 


Wide 
Selection 


3.79 
9.29* 


3.71 
7.20' 


,** 


15.16** 
0.80 


16.32** 
10.96** 


5.80* 
6.59* 


Shortest 
Tine 


12.90 
8.71 


** 


15.00 
1.43 


4.73* 
2.53 


31.20** 
4.18* 


Dncongested 
Roads 


19.69** 
1.32 


26.92** 
10.00** 


28.42** 
2.67 


5 
Several 
Stores 


21.32" 
14.04* 


15.22" 
8.94 


** 


6 
Quick 
Parking 


12.16 
7.46 


** 


Close  to 
Home 


Significance  level  <  .01 
.01  <  Significance  level  <  .05 


Layers  from  top 

1.  L'R'X'^  due  to  1,1  Interaction 

2.  L.R.x^  due  to  2,2  Interaction 
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Tables  6.9a-6.9c  Attitudinal  Orientation  Pattern  Prediction  Tables^'^  - 

Case  III,  Main  Effects  Models 


Table  6.9a 
Many  Employees 
1         2 


g^ 

11 

19 

•H 

(14) 

(15) 

0) 

iH 
0) 

(U 
T3  eg 

116 

202 

•H 

13 

(113) 

(206) 

Table 

6.9b 

Wide 

Se 

lection 

1 

2 

13 

143 

2  '-^ 

(18) 

(138) 

4-1 

n 

Oi 

u 

M 
O  fM 

16 

176 

(11) 

(161) 

Row  Parameter  -2.3978 
(0.193) 
Col.  Parameter  -0.5540 
(0.111) 
Goodness-of-fit  x^^  1'84 


Row  Parameter  -0.2076 
(0.127) 
Col.  Parameter  -2.3978 
(0.193) 
Goodness-of-fit  x^=  3.79 


Table 

6.9c 

Wide 

Se; 

Lection 

' 

1 

2 

8 

92 

^ 

(13) 

(87) 

-o 

0) 

21 

227 

o 

(16) 

(232) 

P 

Cell  numbers  are  predicted 
frequencies  with  observed 
frequencies  in  parenthesis 

The  standard  deviation  of 
each  parameter  is  in 
parenthesis 

^Each  X  value  has  1  d.f. 


Row  Parameter  -0.9083 
(0.118) 
Col.  Parameter  -2.3978 
(0.193) 
Goodness-of-fit  x^-  3.71 
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Table  6.10  Attltudioal  Orientation.  Pattern  Prediction  Table  -  Case  I,  Main 
Effects  Models 


Many  Employees 
1 2 3 


17 

32 

41 

^ 

(25) 

(29) 

(36) 

1 

H           22 

41 

53 

U  CM 

S         (20) 

(47) 

(48) 

27 

SO 

66 

CO 

(20) 

(47) 

(76) 

Row  Faramf.cers:  -0.4630  -0.2179 

(0.139)   (0.125) 
Col.  Paraaeters:  -0.9008  -0.2630 

(0.147)   (0.120) 
Goodness-of-flt  x^=  9-59 

a,b 
Table  6.11  Attltudinal  Orientation  Pattern  Prediction  Table  -  Case  II,  Main 

Effects  Models 


Many  Employe 

es 

1 

2 

3 

57 

27 

72 

■-( 

(67) 

(25) 

(64) 

01 

n 

■H 

H 

u          18 

9 

22 

IS  CM 

0) 

tJ        (16) 

(13) 

(20) 

o 

-C 

M 

52 

25 

66 

PI 

(44) 

(23) 

(76) 

Row  Parameters:  0.0870  -1.071 
(0.115)  (0.165) 
Col.  Parameters;  -0.231  -0.9543 
(0.119)  (0.150) 
Goodness-of-flt  x  •  8.26 

xhe  cross-classification  tables  show  predicted  and  observed  (in  parenthesis) 
cell  frequencies. 

The  standard  deviations  of  parameter  estlicates  are  in  parenthesis. 
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Tables  6.12a  -  6.12d  Attitudlnal  Orientation  Pattern  Prediction  Tables**^ 

Effects  +1,1  Interaction  Models 


Main 


Table  6.12a 


Case  I 


Table  6.12b 


Many  Employees 
1 2 3 


Wide  Selection 
1 2 3 


J 


•  IS 

« 
*J 
u 

o 

M 


25 
(25) 


28 
(29) 


37 
(36) 


18 

A2 

55 

(20) 

(47) 

(48) 

22 

53 

68 

(20) 

(47) 

(76) 

r^  (9) 


13 
(16) 


68 
(65) 


1-t  — 

H 
u 
« 

2 
(3) 

18 
(20) 

95 
(92) 

U 

\i 

M 

3 
(2) 

22 
(16) 

118 
(125) 

Row  Parameters:  -0.3733 

-0.2179 

(0.139) 

(0.125) 

Col.  Parameters:  -0.8786 

-0.2630 

(0.148) 

(0.120) 

1,1  Interaction:  0.7401 

(0.291) 

Goodness-of-fit  x  •  3.38 

Table  6.12c 

Row  Parameters: 

0.0267 

-0.2179 

(0.212) 

(0.125) 

Col.  Parameters; 

;  -3.1791 

-1.6907 

(0.324) 

(0.151) 

1,1  Interaction! 

;  ,1.7267 
X  :4.00 

(0.571) 

Goodness-of-fit 

Table  6. 

,12d 

■0 

a 
i 


Many  Employees 
1 2 3 


22 
(22) 


■o     15 
u         (16) 

4)    

00 

a 

§«   28 
B'^     (27) 


14 
(12) 


37 
(48) 


72 
(63) 


19 
(21) 


48 
(36) 


93 
(103) 


% 
o 

P3 

O 

4J  CM 

4) 

« 
O 


Many  Employees 
1       2       3 


,   29 
(29) 

27 
(27) 

35 
(35) 

,   1* 
(14) 

38 
(49) 

49 
(38) 

.   22 
(22) 

58 
(47) 

76 
(87) 

Row  Parameters: 

-1.1559 

-0.6575 

(0.157) 

(0.123) 

Col.  Parameters: 

-0.7384 

-0.2630 

(0.149) 

(0.120) 

1,1  Interaction: 

1.3548 

(0.320) 

Goodness-of-fit  ' 

y^:  9.30 

Row  Parameters: 

Col.  Parameters: 

1,1  Interaction: 
Goodness-of-fit  X 


-0.4202 
(0.192) 
-0.8928 
(0.150) 
1.0548 
2-  9.71 


-0.4347 
(0.127) 
-0.2630 
(0.119) 
(0.287) 


*The  cross-classification  tables  show  predicted  and  observed  (in  parenthesis)  cell 
frequencies. 
The  standard  deviations  of  parameter  estimates  are  In  parenthesis. 
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Tables  6.12e  -  6.13b  Attltudlnal  Orientation  Pattern  Prediction  Tables 

Effects  +1,1  Interaction  Models 


a,b 


-  Main 


Case  I 

Table  6.12e 

Wide  Selection 
J^ 2  3 


7 

13 

71 

■-I 

(7) 

(18) 

(66) 

SB 

O 
U  CM 

3 

15 

83 

•1 

(4) 

(19) 

(78) 

s 

o 

m 

A 

24 

128 

(3) 

(15) 

(138) 

Row  Parameters:  -0.2279  -0.4347 

(0.203)  (0.127) 

Col.  Parameters:  -3.0428  -1.6907 

(0.285)  (0.151) 

1,1  Interaction:  1.0906  (0.549) 

Goodness-of-fit  x^-'  8.41 


Table  6.13a 

Many  Employees 
1        2       3^ 


H 

67 

25 

64 

1 

(67) 

(25) 

(64) 

H 

15 

9 

24 

41 
U 

(16) 

(13) 

(20) 

U 

o 

^ 

M 

fO 

45 

27 

71 

(44) 

(23) 

(76) 

Case  II 

Table  6.13b 

Many  Employees 
12                  3 

(0  iH 

•s 
5    . 

49 
(49) 

14                37 
(16)            (35) 

« 
*J  CM 

a 

17 
(18) 

10                27 
(15)            (22) 

o 

-    1 

CO 

61 
(60) 

37                96 
(30)          (103) 

Row  Parameters:   0.0687  -1.0709 

(0.117)  (0.165) 

Col.  Parameters:  -0.2973  -0.9643 

(0.126)  (0.150) 

1,1  Interaction:   0.5045  (0.224) 

Goodness-of-fit  x^:  3.19 


Row  Parameters:   -0.7069  -1.2554 

(0.127)  (0.152) 

Col.  Parameters:  -0.2097  -0.9643 
(0.120)   (0.150) 

1,1  Interaction:   0.7391   (0.242) 

Goodness-of-fit  x^'-    5.04 


*The  cross-classification  tables  show  predicted  and  observed  (in  parenthesis)  cell 
frequencies. 
The  standard  deviations  of  parameter  estimates  are  in  parenthesis. 


Tables  6.13c  -  6.13d  Attitudlnal  Orientation  Pattern  Prediction  Tables''*"  -  Main 
Effects  +1,1  Interaction  Models 


Case  II 
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Table  6.13c 


Table  6.13d 


Many  Employees 
12                  3 

5 

33 
(33) 

10                26 
(8)            (28) 

u 

15 
(17) 

8                22 
(13)            (15) 

3 

79 
(77) 

43              112 
(40)          (117) 

Shortest  Time 
A 2 3 


u 

> 
« 


95 
(95) 

20 
(21) 

57 
(56) 

20 
(25) 

10 
(11) 

28 
(21) 

41 
(36) 

20 
(17) 

58 
(66) 

Row  Parameters:  -1.2643  -1.6487 

(0.141)  (0.162) 

Col.  Parameters:  -0.1574  -0.9643 

(0.124)  (0.150) 

1,1  Interaction:   0.5901  (0.272) 

Goodnes9-of-fit  x^=  5.79 


Row  Parameters: 

0.2716 

-0.7361 

(0.124) 

(0.161) 

Col.  Parameters: 

-0.0580 

-1.0710 

(0.124) 

(0.165) 

1,1  Interaction: 

0.8441 

(0.220) 

Goodness-of-fit 

X2:   5.62 

Tables  6.14a  -  6.14b  Attitudinal  Orientation  Pattern  Prediction  Tables^' ^  -  Main 
Effects  +   2,2  Interaction  Models 


Case  II 


Table  6.14a 


Table  6.14b 


Many 

Employees 

1 

2 

3 

58 

25 

73 

1 

(67) 

(25) 

(64) 

H 

u 

16 

13 

20 

« 

(16) 

(13) 

(20) 

t.; 

Xi 

53 

23 

67 

(44) 

(23) 

(76) 

Many  Employees 
1 2 3 


Row  Parameters:   0.0870  -0.9924 

(0.116)  (0.169) 

Col.  Parameters:  -0.2310  -0.8580 

(0.119)  (0.135) 

2,2  Interaction:   0.6357  (0.358) 

Goodness-of-fit  x^-  5.35 


26 

11 

32 

s 

(33) 

(8) 

(28) 

Jd 

« 

14 

13 

18 

jrf 

(17) 

(13) 

(15) 

O 

^. 

87 

37 

110   . 

(77) 

(40) 

(117) 

Row  Parameters:   -1.2212  -1.5607 

(0.137)  (0.165) 

Col.  Parameters:  -0.2310  -0.8293 

(0.119)  (0.140) 

2,2  Interaction:   0.7693  (0.364) 

Goodness-of-fit  x^:  6-32 


*The  cross-classification  tables  show  predicted  and  observed  (in  parenthesis)  cell 
frequencies. 
The  standard  deviations  of  parameter  estimates  are  in  parenthesis. 
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bring  any  improvement  in  attitudinal  pattern  prediction  above  that 
provided  by  the  Main  Effects  models. 

Models  With  Main  Effects  and  2,2  Interactions 

With  the  addition  of  a  2,2  interaction  the  significant  reductions 
in  X^  values  due  to  this  interaction  were  not  enough  to  appreciably 
improve  the  accuracy  of  reproducibility  of  observed  attitudinal 
orientation  patterns  between  pairs  of  factors  for  Case  I,  shown  in 
Table  6.4b.   For  Case  II  in  Table  6,5b  only  the  pairings  of  "Many 
Employees"  with  "Shortest  Time"  and  "Many  Employees"  with  "Quick 
Parking"  yield  reasonable  models. 

The  significant  X  reductions  due  to  the  2,2  interactions  are 
indicated  in  Tables  6.7  and  6.8.   Observed  and  predicted  frequency 
distributions  for  models  described  above  are  shown  in  Tables  6.14a 
and  6.14b. 

Models  With  Main  Effects,  1,1  and  2,2  Interactions 

The  models  fitted  using  main  effects  as  well  as  1,1  and  2,2 
interactions  give  good  predictions  when  the  factor  "Many  Employees" 
is  paired  with  "Shortest  Time,"  "Uncongested  Roads"  and  "Close  to 
Home"  as  shown  in  Table  6.4b  for  Case  I.   The  same  applies  in  this 
table  to  the  pairings  of  "Wide  Selection"  with  "Shorter  Time," 
"Wide  Selection"  with  "Quick  Parking"  and  "Shorter  Time"  with 
"Several  Stores."  Case  II  essentially  gives  similar  results  in 
Table  6.5b  and  in  addition  shows  good  predictions  for  "Many  Employees" 
paired  with  "Quick  Parking,"  "Several  Stores"  with  "Quick  Parking" 
and  "Quick  Parking"  with  "Close  to  Home."   The  predicted 
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and  observed  frequency  distributions  for  the  models  mentioned  are 
shown  in  Tables  6.15a  to  6,15f  and  Tables  6.16a  to  6.16J. 

In  the  few  situations  in  which  Case  I  differs  from  Case  II» such 
a  difference  can  be  attributed  to  the  way  the  categories  were  defined. 
Case  II  appears  preferable  since  it  provides  good  predictions  for 
more  pairings  of  factors  than  Case  I.   If  the  sample  size  was  large 
enough, or  the  categories  wereno  more  than  five,  it  would  not  be  necess- 
ary to  have  to  try  different  combinations  of  extreme  categories  on 
the  successive  category  rating  scale  to  obtain  a  useful  classification 
table.  The  main  effect  with  1,1  and  2,2  interaction  model  is  the  one 
recommended  as  best  fitting  the  survey  data  for  Cases  I  and  II. 
This  is  on  the  basis  of  the  good  predictions  that  this  model  achieves, 
demonstrated  by  the  associated  low  goodness-of-fit  X^  values. 

Deductions  From  the  Models 


The  important  deduction  that  can  be  made  in  evaluating  the  models 
is  that  the  following  pairs  of  factors  may  be  considered  in  modelling 
shopping  destination  choice  for  the  sample  investigated  on  the  basis 
of  Case  II  models. 

1.  "Many  Employees"  and  "Shortest  Time" 

2.  "Many  Employees"  and  "Uncongested  Roads" 

3.  "Many  Employees"  and  "Quick  Parking" 

4.  "Many  Employees"  and  Close  to  Home" 

5.  "Wide  Selection"  and  "Shortest  Time" 

6.  "Wide  Selection"  and  "Quick  Parking" 

7.  "Shorter  Time"  and  "Several  Stores" 
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Tables  6.15a  -  6.15d 


Attltudlnal  Orientation  Pattern  Prediction  Tables*'**  -  Main 
Effects  +1,1  Interaction  +2,2  Interaction  Models 


Case  I 


Table  6.15a 


Table  6.15b 


Many  Employees 
1 2 


I 


o 

01 


25 
(25) 

27 
(29) 

38 
(36) 

17 
(20) 

47 
(47) 

51 
(48) 

23 
(20) 

49 
(47) 

71 
(76) 

Many  Employees 
A 2       3 


m 


a 
o 
o 

5  tn 


22 
(22) 

13 
(12) 

20 
(21) 

12 
(16) 

48 
(48) 

40 
(36) 

31 
(27) 

62 
(63) 

100 
(103) 

Bow  Parameters:  -0.3826 

-0.2297 

(0.138) 

(0.126) 

Col.  Parameters:  -0.8839 

-0.2677 

(0.148) 

(0.120) 

1,1  Interaction:  0.6871 

(0.289) 

2,2  Interaction:   0.2737 
Goodness-of-fit  x  •  2.07 

(0.238) 

Table  6.15c 

Row  Parameters:  -1.1734  -0.7014 


(0.157) 

(0.126) 

Col.  Parameters: 

-0.2519 

1.2574 

(0.152) 

(0.121) 

1,1  Interaction: 

1.2574 

(0.323) 

2,2  Interaction: 

0.6603 

(0.345) 

Goodness-of-fit 

X2:  2.06 

Table  6. 

15d 

Many  Employees 
JL 2 3 


Q 
O 


29 
(29) 

24 
(27) 

38 
(35) 

12 
(14) 

49 
(49) 

40 
(38) 

24 
(22) 

50 
(47) 

82 
(87) 

..3 


■9  N 

tl 

U 

o 
u 


Wide  Selection 
1 2 3_ 


9 
(9) 

12 
(16) 

69 
(65) 

2 
(3) 

20 
(20) 

93 
(92) 

3 
(2) 

20 
(16) 

120 
(125) 

Row  Parameters: 
Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodness-of-fit 


-0.4418 

(0.136) 

-0.9049 

(0.150) 

0.9386 

0.6821 

(^:  2.00 


-0.4816 
(0.131) 
-0.2550 
(0.121) 
(0.290) 
(0.245) 


Row  Parameters: 
Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodness-of-fit 


0.0214 
(0.212) 
-3.1807. 
(0.324) 
1.7096 

0.2421 
<2:  3.41 


-0.1753 
(0.136) 
-1.6963 
(0.151) 
(0.181) 
(0.311) 


The  cross-classification  tables  show  predicted  and  observed  (in  parenthesis)  cell 
f  requenorles . 
°The  standard  deviations  of  parameter  estimates  are  in  parenthesis. 


150 


Tables  6.15e,£  and  6.16a,b 


a  b 

Attltudlnal  Orientation  Pattern  Prediction  Tables  ' 

Main  Effects  +  1,1  Interaction  +  2,2  Interaction  Models 


Case  I 


Table  6.15e 


Table  6.15f 


Wide 

Selection 

1 

2 

3 

H 

9 

4 

31 

M 

(9) 

(5) 

(30) 

a 

M 

u 

<0 

1 

21 

49 

M 

(2) 

(21) 

(47) 

O 

•H    _ 

4 
(3) 


27 
(26) 


203 
(205) 


Shortest  Time 
1 2 3_ 


a 
« 
u 

o 


u 

4) 

> 


40 

26 

43 

(40) 

(30) 

(39) 

22 

55 

43 

(27) 

(55) 

(38) 

28 
(23) 


34 
(30) 


56 
(66) 


Row  Parameters:  -0.9883  -1.0435 

(0.222)  (0.143) 

Col.   Parameters:-3.0222  -1.6233 

(0.326)  (0.154) 

1.1  Interaction:  2.6827  (0.585) 

2.2  Interaction:  1.1936  (0.323) 
Goodness-of-fit  Y^:  1.44 


Row  Parameters: 
Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodness-of-fit 


-0.0671 

(0.134) 

-0.4863 

(0.136) 

0.6184 

0.7462 

'2:   5.96 


-0.0187 
(0.131) 
-0.2588 
(0.128) 
(0.260) 
(0.242) 


Table  6.16a 


Case  II 


Table  6.16b 


Many  Employees 
1 2 3_ 


Many  Employees 

2  3 


I 


eg 
4) 
U 
U 
O 
A 
M 


67 
(67) 

23 
(25) 

66 
(64) 

14 
(16) 

13 
(13) 

22 
(20) 

46 
(44) 

25 
(23) 

72 
(76) 

a  iH 

<d 
o 
OS 

« 

U  CM 

00 
0) 

eo 

a 
o 
o 
a  « 

9 


49 
(49) 

13 
(16) 

38 
(35) 

16 
(18) 

15 
(15) 

24 
(22) 

62 
(60) 

33 
(30) 

98 
(103) 

Row  Parameters: 
Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodness-of-fit 


0.0680 
(0.116) 

-0.2942 

(0.124) 

0.4631 

0.5315 

^•.   1.16 


-1.0097 
(0.169) 
-0.8791 
(0.159) 
(0.226) 
(0.363) 


Row  Parameters: 
Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodness-of-fit 


-0.7075 

(0.127) 

-0.2137 

(0.120) 

0.6987 

0.5948 

t^:   1.11 


-1.1875 
(0.156) 
-0.8825 
(0.156) 
(0.244) 
(0.344) 


*The  cross-classification  tables  show  predicted  and  observed  (in  parenthesis)  cell 

frequencies 
"The  standard  deviations  of  parameter  estimates  are  in  parenthesis 
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Tables  6.16c  -  6.16f  Attltudlaal  Orientation  Pattern  Prediction  Tables**'' 

Effects  +  1,1  Interaction  +  2,2  Interaction  Models 


Main 


Case  II 


Table  6.16c 


Table  6.16d 


Many  Employees 
1       2       3 

5 

33 
(33) 

9      27 
(8)     (28) 

•2 

P*   CM 

13 
(17) 

13      19 
(13)     (15) 

81 
(77) 

39      114 
(40)    (117) 

Many  Employees 
J^ 2 3 


Row  Parameters: 
Col.  Parameters 

1.1  Interaction 

2.2  Interaction 
Goodness-of-fit 


g 

03 

O 

« 

■ 
o 

1-4 

u 


-1.2640 
(0.141) 
:  -0.1635 
(0.124) 
:  0.5536 
;  0.7127 
X2:    2.25 


-1.5704 

(0.165) 

-0.8417 

(0.160) 

(0.272 

(0.364) 


63 

(63) 

20 
(26) 

59 
(53) 

14 
(16) 

14 
(14) 

22 
(20) 

50 
(48) 

27 
(21) 

79 
(87) 

Table  6.16e 


Row  Parameters: 

-0.1190 

-1.0692 

(0.117) 

(0.165) 

Col.  Parameters: 

:  -0.2824 

-0.8692 

(0.122) 

(0.157) 

1,1  Interaction: 

0.5238 

(0.228) 

2,2  Interaction: 

0.6229 

(0.355) 

Goodness-of-fit 

X^:  4.97 

Table  6. 

16f 

Wide  Selection 
1       2 


iH 

18 

13 

125 

1 

(18) 

(16) 

(122) 

H 

U 
tt  «N 

«) 
U 

u 
o 

2 
(2) 

12 
(12) 

35 
(35) 

M 

9 

12 

122 

tn 

(9) 

(9) 

(125) 

Wide 

1 

Selection 
2       3 

a 

•s 

o 

13 
(13) 

8      79 
(14)     (73) 

«  CM 

■ 

3 
(2) 

12      40 
(12)     (41) 

1 

o 

13 
(14) 

17      163 
(11)    (168) 

Row  Parameters: 
Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodness-of-fit 


0.2648 
(0.153) 

-2.4177 
(0.224) 
0.7156 
1.2341 

<2:  2.07 


-0.8503 
(0.175) 
-1.8809 
(0.179) 
(0.399) 
(0.398) 


Row  Parameters: 

-0.4816 

-1.0534 

(0.154) 

(0.165) 

Col.  Parameters: 

-2.2951 

-1.9139 

(0.200) 

(0.178) 

1,1  Interaction: 

0.7386 

(0.394) 

2,2  Interaction: 

1.0702 

(0.388) 

Goodness-of-fit 

X^:  6.97 

*The  cross-classification  tables  show  predicted  and  observed  (in  parenthesis)  cell 

frequencies. 
''The  standard  deviations  of  parameter  estimates  are  in  parenthesis. 
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Tables  6.16g  -  6.16J  Attitudlnal  Orientation  Pattern  Prediction  Tables*'**  -  Main 

Effects  +1,1  Interaction  +2,2  Interaction  Models 


Table  6.16g 


Table  6.16h 


Wide 

1 

Selection 
2       3 

l-l 

a 

15 
(15) 

5      49 
(8)     (46) 

4{ 

2 
(2) 

12       31 
(12)     (31) 

3 

12 
(12) 

20      202 
(17)    (205) 

Shortest  Time 
1       2 3 


« 
u 

o 


« 
CO 


95 
(95) 

19 
(21) 

58 
(56) 

19 
(25) 

11 
(11) 

27 
(21) 

42 
(36) 

19 
(17) 

58 
(66) 

Row  Parameters: 
Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodaess-of-fit 


-0.8713 
(0.161) 
-1.8551 
(0.208) 
1.6256 
1.3528 


.2. 


2.42 


-1.4180 
(0.172) 
1.6256 
(0.179) 
(0.407) 
(0.396) 


Table  6.161 


Row  Parameters: 

0.2719 

-0.7002 

(0.124) 

(0.171) 

Col.  Parameters: 

:  -0.0566 

-1.0418 

(0.124) 

(0.171) 

1,1  Interaction: 

0.824A 

(0.223) 

2,2  Interaction: 

:   0.2283 

(0.382) 

Goodness-of-fit 

X^:  5.28 

Table  6. 

16J 

Several  Stores 
12       3 

•H 

3 

48 
(48) 

6      15 
(9)     (12) 

m 

■3 

4 
(3) 

10       4 
(10)      (5) 

93 
(94) 

41      100 
(38)    (102) 

Quick  Parking 
12       3 


f-> 

41 

14 

87 

i 

(41) 

(19) 

(82) 

M 

O 
U   CM 

6 

13 

31 

•I 

(6) 

(13) 

(31) 

o 

El 

22 

18 

116 

(22) 

(13) 

(122) 

Row  Parameters:  -1.4898  -2.5920 


Col.  Parameters: 

1.1  Interaction: 

2.2  Interaction: 
Goodness-of-fit 


(0.167) 
0.3428 

(0.133) 

1.2403 

1.7752 

(2;  2.44 


(0.268) 
-0.3018 
(0.201) 
(0.292) 
(0.500) 


Row  Parameters: 

0.0133 

-0. 

.9893 

(0.123) 

(0. 

.169) 

Col.  Parameters: 

-1.3561 

-1. 

.5219 

(0.170) 

(0. 

.167) 

1,1  Interaction: 

0.9015 

(0. 

.275) 

2,2  Interaction: 

0.9758 

(0, 

.373) 

Goodness-of-fit 

)(2:  4.02 

*The  cross-classification  tables  show  predicted  and  observed  (in  parenthesis)  cell 

frequencies. 
"The  standard  deviations  of  parameter  estimates  are  in  parenthesis. 
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8.  "Several  Stores"  and  "Quick  Parking" 

9.  "Quick  Parking"  and  "Close  to  Home" 

It  is  noticed  that  some  of  the  pairings  of  factors  are  essenti- 
ally indicating  the  same  thing.   An  example  is  that  with  pairs  1  and 
5  since  where  there  is  a  wide  selection  of  goods  there  are  usually- 
more  employees  and  congested  roads  usually  require  more  travel  time. 
The  results  cannot  be  taken  as  conclusive  but  it  must  be  noted  that 
generally  no  three  of  the  factors  can  be  considered  jointly  in  a 
model  since  two  of  them  aire  likely  to  have  a  significant  interaction 
that  can  distort  the  predictive  power  of  a  model  using  all  three 
factors.  This  is  a  welcome  finding  especially  if  it  can  be  verified 
in  further  research  since  it  is  «likely  to  indicate  that  model  devel- 
opment should  be  with  just  two  or  three  attitudinal  factors  that  can 
be  associated  with  some  measurable  variables.  An  example  of  a 
good  pair  would  be  "Wide  Selection"  and  "Shortest  Time," 

Summary 

It   was    established  that  therewas  significant  correlation 
between  many  pairs  of  the  destination  choice  successive  category 
factors  available-  for  this  research.  The  frequency  distributions  of 
the  ratings  for  the  factors  did  not  display  normality  necessary  for 
classical  successive  category  data  analysis  approaches.  This  necess- 
itated the  use  of  contingency  tables  with  a  smaller  number  of  cate- 
gories in  the  ways  of  classification  because  of  the  small  sample 
size,  for  reproducing  observed  attitudinal  orientation  patterns. 
Such  predictability  is  argued  to  be  necessary  in  short  range 
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forecasting  or  the  expansion  of  attitudinal  orientation  patterns  for 
the  total  population  using  data  for  a  sample. 

It  was  found  that  essentially  only  two  of  the  factors  need  be 
present  in  a  single  modelling  framework  since  most  of  them  have  a 
similar  effect.  A  combination  of  three  factors  cannot  occur  without 
significant  interaction  between  two  of  them  distorting  the  modelling 
results.  With  the  inclusion  of  certain  interaction  effects  in  2-way 
classifications,  good  predictability  of  the  attitudinal  orientation 
patterns  was  possible  for  many  pairs  of  factors.  The  essential 
factors  are  related  to  measures  of  the  wide  range  of  goods  or  services 
at  a  shopping  area  or  the  convenience  in  reaching  a  destination  in 
terms  of  travel  time  and  distance. 
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CHAPTER  7 
ESTIMATION  WITH  PROPOSED  FUNCTIONAL 
DESTINATION  CHOICE  MODELLING  FRAMEWORK 

The  objective  of  this  chapter  is  to  illustrate  the  multinomial 
response  relation  framework  for  modelling  the  choice  of  functional 
destinations.  This  modelling  framework  has  a  logical  provision  for 
aggregate  forecasts  and  hence  can  be  used  in  an  operational  set  of 
models.  It  is  clear  that  the  sample  size  used  in  this  research  was 
limited  by  its  smallness.  Also  the  data  were  not  collected  with  the 
analytical  procedures  discussed  in  this  research  in  mind.  The  examples 
of  the  application  of  the  multinomial  response  relation  approach  are, 
therefore,  given  the  best  way  the  data  allowed.  The  fundamental 
theoretical  arguments  for  adopting  the  approach  illustrated  in  this 
chapter  were  presented  in  Chapter  5.   It  is  strongly  maintained  by  the 
writer  that  if  encouraging  results  occur  they  should  serve  as  a  basis 
for  the  collection,  in  the  future,  of  a  more  appropriate  data  set  to 
better  appreciate  the  usefulness  of  the  multinomial  response  relation 
approach. 

Not  more  than  three  market  segments  were  considered  at  a  time  in 
the  empirical  analyses  discussed  in  this  chapter  mainly  because  of  the 
small  sample  size  of  households.   It  would  have  been  desirable  not 
only  to  have  considered  2-way  classification  categories  for  market 
segmentation  with  respect  to  the  attitudinal  variables  but  to 
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combine  the  attitudinal  factors  and  the  appro achability  measures  for 
segmentation  for  the  same  model.   Though  the  small  sample  size  did 
not  permit  this,  the  effect  of  the  combination  of  attitudinal  and 
approachability  factors  was  examined  by  testing  their  independence 
and  the  reproducibility  of  their  classifications. 

The  functional  classification  of  destinations  was  also  limited  to 
2-way  classifications  because  of  the  small  sample  size  of  the  avail- 
able data.   2-way  classifications  involving  Retail  Floor  Area  with 
each  of  Line  Haul  Travel  Time  and  Airline  Distance  were  used  in  model 
development  to  define  functional  destination  alternatives,  as  illus- 
trated in  Figures  7.1  and  7.2.   Based  on  the  fact  that  not  more  than 
three  market  segments  were  involved  in  any  single  application  of  the 
multinomial  response  relation  approach,  a  separate  contingency  table 
for  each  market  segment  was  analyzed.   A  comparison  of  the  parameters 
estimated  for  models  predicting  the  contingency  table  cell  frequencies 
established  the  effectiveness  or  need  for  a  given  variable  or  factor 
to  be  used  as  a  basis  for  such  a  one-way  market  segmentation.  Some 
multivariate  statistical  tests  were  employed  for  testing  the  equality 
of  model  parameter  vectors  and  their  covaiance  matrices. 

The  data  that  were  used  for  the  empirical  analysis  in  this  chapter 
were  derived  from  data  originally  collected  by  Kannel  [10]  in  Indiana- 
polis in  1971,  supplemented  with  further  data  collection  and  prepara- 
tion by  the  writer.      The  data  base  has  been  described  in  Chapters  4 
and  6.   A  discussion  of  some  assumptions  implied  in  functional 
destination  choice  modelling,  with  the  aid  of  some  empirical  analysis 
presented  in  Appendix  B,  is  also  presented. 
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OJ 


TCLS 
2 


KFL 

1  Retail    Floor   Area  ^  300,000 sq.  ft. 

2  Retail    Floor  Area  <  300,000sq.  ft. 

TCLS 

1  Line  Haul   Travel  Time^lOmins. 

2  Smins.^  Line  Haul  Travel  Time  "^  lOmins. 

3  Line  Haul  Travel  Time'*^  5  mins. 


FIGURE    71  FUNCTIONAL    SHOPPING  DESTINATION  CATEGORIES 
DEFINED    BY   RETAIL   FLOOR    AREA    AND 
LINE    HAUL    TRAVEL    TIME 
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CM 


DCLS 
2 


KFL 

1  Retail    Floor   Area  ^  300,000sq.  ft. 

2  Retail    Floor  Area  <  300,000sq.  ft. 

DCLS 

1  Airline   Distance  ^8 units 

2  4units^Airline   Distance  "^^  8 units 

3  Airline   Distance*-^  4units 

Each    unit  of    Distance  is  equivalent  to  0.833  miles 


FIGURE    7-2  FUNCTIONAL    SHOPPING  DESTINATION  CATEGORIES 
DEFINED    BY   RETAIL   FLOOR    AREA    AND 
AIRLINE    DISTANCE 
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The  theory  and  application  of  the  multinomial  response  relation 
(MRR)  approach  have  been  presented  in  Chapter  5.  Detailed  example 
calculations  are  given  in  Appendix  A.  The  bases  used  for  empirical 
investigation  in  this  chapter  are  derived  in  the  manner  discussed  for 
the  basis  construction  example  in  Chapter  5  since  all  the  applications 
in  this  chapter  are  for  2x3  crossed  designs. 

Prediction  of  Functional  Shopping  Destinations  Defined  With  Respect  to 
Retail  Floor  Area  and  Line  Haul  Travel  Time  by  Attitudinal  Factor 
Market  Segments 

Table  7.1  summarizes  the  result  of  an  experimental  model  to  repro- 
duce the  cell  frequencies  in  the  contingency  table  with  respect  to 
Retail  Floor  Area  and  Line  Haul  Travel  Time  that  defines  functional 
shopping  destinations.  The  predictive  accuracy  of  the  model  summar- 
ized in  Table  7.1  is  very  good  and  reflects  on  the  reliability  of 
destination  choice  models  in  terms  of  functional  destinations.  The 
question  that  is  addressed  in  this  section  is  whether  the  attitudinal 
orientation  of  groups  of  people  causes  them  to  select  functional 
shopping  destinations  differently.  With  respect  to  each  destination 
choice  attitudinal  factor,  an  individual  belongs  to  market  segment  1 
if  the  rating  for  that  factor  on  the  successive  category  scale  is  less 
than  or  equal  to  4  and  to  segment  2  if  the  rating  is  greater  than  4. 
Less  than  10  percent  of  the  sample  fell  into  market  segment  1  for  the 
attitudinal  factor  relating  to  "Shortest  Travel  Time"  so.it  was  not 
possible  to  develop  a  predictive  model  for  segment  1  for  this  factor 
for  which  reason  it  was  excluded  from  the  analysis. 
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Table  7.1  Prediction  of  Functional  Destination  Choice  Frequencies 

for  the  Total  Sample  With  Respect  to  Retail  Floor  Area  and 
Line  Haul  Travel  Time  ^»b 


1 

TCLS 
2 

3 

58 

68 

76 

(61) 

(68) 

(73) 

9 

10 

12 

(6) 

(10) 

(15) 

Parameters 

Row:  1.874 

Col.  (l):-0.273 

Col.  (2):-0.121 

7 
Goodness-of-fit  x 


Std. 
Deviation 

(0.193) 
(0.162) 
(0.155) 
2.23  (2  d.f) 


KFL  and  TCLS  are  retail  floor  area  and  line  haul  travel  time  categories 
defined  in  Figure  7.1 

Numbers  in  cells  are  predicted  frequencies  with  observed  frequencies 
in  parenthesis 


Table  7.2  Prediction  of  Functional  Destination  Choice  Frequencies  for 
the  Total  Sample  With  Respect  to  Retail  Floor  Area  and 
Airline  Distance  ^'^ 


1 

DCLS 
2 

3 

Row: 
Col.  (1): 
Col.  (2): 
Goodness- 

Parameters 

1.874 
-1.304 
-0.626 
of-fit  x^:  1 

Std. 
Deviation 

^ 

30 
(31) 

60 
(62) 

112 
(109) 

(0.193) 
(0.190) 
(0.149) 
27   (  2  d.f.) 

^  • 

5 
(4) 

9 

(7) 

17 

(20) 

KFL  and  DCLS  are  retail  floor  area  and  airline  distance  categories 
defined  in  Figure  7.2 

Numbers  in  cells  are  predicted  frequencies  with  observed  frequencies 
in  parenthesis 
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2 
The  goodness-of-fit  X  values  for  models  fitted  with  main  effects 

with  respect  to  attitudinal  market  segments  in  predicting  functional 
destinations  are  shown  in  Table  7.3.   In  three  cases  the  X  value  was 
significant,  at  most,  at  the  5-percent  level  of  significance.  After 
fitting  the  1,1  interaction  effect  in  addition  to  main  effects,  as 
shown  in  Table  7.3,  the  goodness-of-fit  X  value  is  reduced  in  all 
cases.  The  reduction  in  X  value  accounted  for  by  the  1,1  interaction 
is  significant  at  the  5  percent  level  in  all  cases  where  there  was  a 
need  to  fit  this  interaction  to  reduce  the  goodness-of-fit  X  value. 
That  the  contribution  of  the  1,2  interaction  is  small  is  seen  from  the 
reduction  in  goodness-of-fit  it  brings  about  as  shown  in  Table  7.3. 
The  observed  and  predicted  cell  frequencies  and  model  parameter  estim- 
ates are  presented  in  Tables  7.4a  to  7.9b  for  the  main  effects  models 
and  Tables  7.10a  to  7.15b  for  the  main  effects  plus  1,1  interaction 
models.   Considering  the  fact  that  the  relatively  higher  percentage 
errors  of  prediction  occur  with  the  cells  of  low  observed  frequencies, 
many  of  the  main  effects  models  perform  quite  well.  The  models  fitted 
with  main  effects  and  1,1  interaction  give  almost  perfect  prediction. 
This  should  only  serve  as  a  strong  indication  of  the  usefulness  in 

fitting  for  certain  key  interactions  where  the  number  of  degrees  of 

2 
freedom  for  the  goodness-of-fit  X  is  large  enough.   In  the  present 

context,  since  the  1,2  interaction  effects  are  negligible,  fitting 

for  the  1,1  interaction  may  amount  to  a  self-fulfilling  prophecy. 

Interpretation  of  Modelling 

In  Tables 7. 4a  to  7.9b  the  following  interpretation  is  given  to 
the  parameter  estimated.   The  row  parameter  is  denoted  by  F  and  the 
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Table  7.3  Likelihood  Ratio  Goodness-of-Fit  x^  Values  for  Fitting  KFL  x 
TCLS  ^  Contingency  Tables 


Effects 

1  Fitted  in  1 

Model 

Destination  Choice 

Main  Effects^ 

Main 

Effects^ 

Main 

Effects-^ 

Attitudinal  Factor 

+1,1 

Interaction 

+1,2 

Interaction 

Used  for  Market 

Market 

Segment 

Market  Segment 

Market  Segment 

Segmentation 

1 

2 

1 

2 

1 

2 

Many  Employees 

0.62 

9.19* 

0.48 

0.53 

0.02 

8.70** 

Wide  Selection 

5.35 

2.85 

0.28 

0.15 

5.30 

1.49 

Uncongested  Roads 

6.24* 

2.96 

0.42 

0.32 

6.06* 

1.47 

Several  Stores 

1.97 

6.66** 

0.05 

0.02 

1.85 

5.67 

Quick  Parking 

5.84 

3.78 

0.23 

0.83 

3.75 

3.77 

Close  to  Home 

3.67 

5.65 

0.02 

0.34 

2.59 

5.54 

KFL  and  TCLS  refer  to  Retail  Floor  Area  and  Line  Haul  Travel  Time 
Categories  defined  in  Figure  7.1. 

Each  Likelihood  Ratio  Goodness  of  Fit  x^  Value  has  2d.f. 

^Each  Likelihood  Ratio  Goodness  of  Fit  x^  Value  has  Id.f. 

*Signif leant  at  the  0.05  level. 

Significant  at  the  a 01  level. 


Tables  7.4a  -  7.Sb  Functional  Destination  Choice  Predictions  by  Harket  Segments*'^ 


Table  7.4a  Many  Employees 
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g 


Segment  1 
TCLS 
1 2 


Parameters 


Std. 

Deviation 


7 

12 

18 

Row: 

0.6152 

(0 

278) 

(7) 

(11) 

(19) 

Col. 

(1): 

-1.030 

(0 

367) 

Col.  (2):  -0.388 
Goodness-of-flt  x^' 

(0 
0.62 

296) 

3 

7 

10 

(3) 

(8) 

(9) 

Table  7.4b  Many  Employees 

Segment  2 
TCLS 
1 2 3_ 


g 


39 

44 

42 

(46) 

(42) 

(37) 

13 

15 

15 

(6) 

(17) 

(20) 

Row:  1.067  (0-176) 
Col.  (1):  -0.092  (0.191) 
Col.  (2):  0.034       (0.185) 

Goodnesa-of-fit  x^-  9-19 


Table  7.5a  Wide  Selection 


g 


1 

Segment  1 

TCLS 

2 

3 

Rov:  0.609 
Col.  (1):  -0.965 
Col.  (2):  -0.365 

Goodness-of-flt  x  • 

10 
(14) 

19 
(19) 

27 

(24) 

(0.222) 
(0.294) 
(0.238) 

6 
(2) 

11 
(11) 

15 
(18) 

5.35 

Table  7.5b  Wide  Selection 


g 


1 

Segment  2 

TCLS 

2 

3 

■H 

35 
(39) 

37 
(34) 

33 
(32) 

«S 

11 
(7) 

11 
(14) 

10 
(11) 

Row:  1.1882  (0.202) 

Col.  (1):  0.0670  (0.212) 

Col.  (2):  0.1100  (0.210) 

Goodness-of-fit  yi}:  2.85 


'KFL  and  TCLS  are  retail  floor  area  and  line  haul  travel  time  categories  defined 

in  Figure  7.1 
^Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*X^  value  significant  at  52  level 
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Tables  7.6a  -  7.7b  Functional  Destination  Choice  Predictions  by  Market  Segments'* 


Table  7, 

,6a  Uncongested  Roads**'' 

1 

Segment  1 
TCLS 
2 

3 

Parameters 

Row:   0.942 
Col.  (1):  -0.890 
Col.  (2):  -0.033 

Std. 
Deviation 

'*       30 
(36) 

s  — 

43 
(42) 

45 
(40) 

(0.174) 
(0.199) 
(0.180) 

-  5^) 

17 
(18) 

17 
(22) 

Coodnes8-o£-flt  x  •  6. 

.24* 

Table  7.6b  Uncongested  Roads 


g 


1 

Segment  2 
TCLS 
2 

3 

Row:   0.951 
Col.  (1):  -0.140 
Col.  (2):  -0.245 

Goodness-of-fit  x^'- 

w   1* 

(17) 

13 
(11) 

17 
(16) 

(0.285) 
(0.306) 
(0.314) 

«    6 
(3) 

5 

(7) 

6 
(7) 

2.96 

Table  7.7a  Several  Stores 


1 

Segment  1 
TCLS 
2 

3 

Row:   0.665 
Col.  (1):  -0.383 
Col.  (2):  -0.318 

Goodnesa-of-fit  x  • 

^  10 
(12) 

11 
(10) 

14 
(13) 

(0.290) 
(0.334) 
(0.328) 

c*   5 
(3) 

5 

(6) 

8 
(9) 

1.97 

Table  7.7b  Several  Stores 


g 


1 

Segment  2 
TCLS 
2 

3 

Row:   1.038 
Col.  (1):  -0.293 
Col.  (2):  -0.016 

Goodness-of-fit  x^- 

^  35 
(41) 

46 
(43) 

46 
(43) 

(0.173) 
(0.192) 
(0.179) 

12 
^  (6) 

16 
(19) 

17 
(20) 

6.66* 

•kFL  and  TCLS  are  retail  floor  area  and  line  haul  travel  time  categories  defined 

in  Figure  7.1 
''Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*X  value  significance  at  52  level 


rabies  7.8a  -  7.9b  Functional  Destination  Choice  Predictions  by  Market  Segments^'^ 
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Table  7.8a  Qulclc  Parking 

Segment  1 
TCLS 
1 2 


g 


25 

28 

32 

(30) 

(25) 

(31) 

10 

12 

13 

(5) 

(10) 

(U) 

Paraneters 


Col. 
Col. 


How:  0.899 
(1):  -0.273 
(2):  -0.140 


Std. 
Deviation 

(0.200) 
(0.224) 
(0.216) 


Goodneas-of-flt  X^:  5.84 


Table  7.8b  Quick  Parking 


1 

Segaent  2 
TCLS 
2 

3 

Row:   0.998 
Col.  (1):  -0.368 
Col.  (2):  -0.026 

Goodnesa-of-flt  X^: 

20 
(23) 

28 
(28) 

28 
(25) 

(0.221) 
(0.250) 
(0.226) 

7 
(4) 

10 
(10) 

11 
(14) 

3.78 

Table  7.9a  Close  to  Hoae 


1 

Segment 
TCLS 
2 

1 

3 

Sow:   0.860 
Col.  (1):  -0.111 
Col.  (2):  0.026 

Goodnesa-of-flt  x^' 

'.' 

M      24 
(28) 

27 
(25) 

27 
(25) 

(0.208) 
(0.238) 
(0.228) 

CM     10 

"   (6) 

12 
(14) 

11 
(13) 

3.67 

Table  7.9b  Close  to  Home 


Segment  2 
TCLS 


1 

2 

3 

H   21 
(25) 

29 
(28) 

35 
(31) 

CM     7 

(3) 

10 
(11) 

12 
(16) 

Row:  1.080  ■  (0.212) 

Col.  (1):  -0.518  (0.238) 

Col.  (2):  -0.187  (0.216) 

Goodnesa-of-flt  x  •  5.65 


■kFL  and  TCLS  are  retail  floor  area  and  line  haul  travel  time  categories  defined 

in  Figure  7.1 
''Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*x2  value  significance  at  5Z  level 


Tables  7.10a  -  7.11b  Functional  Destination  Choice  Predictions  by  Market  Segments 
1,1  Interaction  Included"*'' 


Tables  7.10a  Many  Employees 
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i 


Segment  1 
TCLS 
1 2 


7 

12 

18 

(7) 

(11) 

(19) 

3 

7 

10 

(3) 

(8) 

(9) 

Std. 

Parameters 

Deviation 

Row:  0.661 

(0.306) 

Col.  (1):  -1.077 

(0.396) 

Col.  (2):  -0.388 

(0.296) 

1.1  Inter-  0.279 

(0.751) 

action 

Goodnes8-o£-£lt  x^'  0.42 


Table  7.10b   Many  Employees 


g 


Segment  2 
TCLS 


1 

2 

3 

(46) 

40 
(42) 

39 
(37) 

"   <J> 

19 
(17) 

18 
(20) 

Sow:   1.185 

(0.196) 

Col.  (1):  -0.469 

(0.266) 

Col.  (2):  0.034 

(0.186) 

1,1  Inter-  1.278 

(0.477) 

action 

Goodness-of-flt  X^-  0-53 


Table  7.11a   Wide  Selection 


Segment  1 
TCLS 


1 

2 

3 

w   1* 

(14) 

18 
(19) 

25 
(24) 

"   (2) 

12 
(U) 

17 
(18) 

Row:  0.911 

(0.298) 

Col.  (1):  -1.359 

(0.408) 

Col.  (2):  -0.336 

(0.239) 

1,1  Inter-  1.552 

(0.791) 

action 

Goodness-of-flt  x^'  0-28 


Table  7.11b   Wide  Selection 


Segment  2 
TCLS 


1 

2 

3 

^   39 
(39) 

35 
(34) 

31 
(32) 

«>.   7 
(7) 

13 
(14) 

12 
(11) 

Row: 

1.220 

(0.208) 

Col.  (1): 

-0.150 

(0.261) 

Col.  (2): 

0.110 

(0.210) 

1,1  Inter- 

0.747 

(0.472) 

action 

Goodness-of-flt  x^^  0-15 

*KFL  and  TCLS  are  retail  floor  area  and  line  haul  travel  time  categories  defined 

in  Figure  7.1 
''Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*X^  value  significance  at  5Z  level 


Tables  7.12a  -  7.13b  Functional  Destination  Choice  Predictions  by  Market  Segments 
1,1  Interaction  Included^>'> 


Table  7.12a.  Uncongested  Roads 
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g 


Segment  1 
TCLS 


1 

2 

3 

36 
(36) 

40 
(42) 

42 
(40) 

6 
(6) 

20 
(18) 

20 
(22) 

Std. 

Parameters 

Deviation 

Row:  1.076 

(0.195) 

Col.  (1):  -0.683 

(0.256) 

Col.  (2):  -0.033 

(0.181) 

1,1  Inter-  1.074 

(0.430) 

action 

Goodness-of-fit  X^^ 

0.42 

Table  7.12b  Uncongested  Roads 


i 


Segment  2 
TCLS 


1 

2 

3 

17 
(17) 

12 
(11) 

15 
(16) 

3 
(3) 

6 
(7) 

8 
(7) 

Row:  1.016 

(0.302) 

Col.  (1):  -0.424 

(0.379) 

Col.  (2):  -0.245 

(0.314) 

1,1  Inter-  1.078 

(0.706) 

action 

Goodness-of-fit  X^- 

0.32 

Table  7.13a  Several  Stores 


s 


1 

Segment  1 
TCLS 
2 

3 

Row:  0.747 
Col.  (1):  -0.584 
Col.  (2):  -0.318 
1,1  Inter-  0.959 
action         _ 
Goodness-of-fit  x  '• 

^    12 
(12) 

10 
(10) 

13 
(13) 

(0.308) 
(0.389) 
(0.328) 

3 
'^    (3) 

6 
(6) 

9 
(9) 

(0.724) 
0.05 

Table  7.13b  Several  Stores 


Segment  2 
TCLS 


1 

2 

3 

41 
(41) 

43 
(43) 

43 
(43) 

6 

(6) 

19 
(19) 

20 
(20) 

Row;  1.168  (0.194) 
Col.  (1):  -0.621  (0.254) 
Col.  (2):  -0.016  (0.179) 
1,1  Inter-  1.131      (0.477) 

action         . 
Goodness-of-fit  x  •  0-02 


*KFL  and  TCLS  are  retail  floor  area  and  line  haul  travel  time  categories  defined 

In  Figure  7.1 
'*Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 

X^  value  significance  at  5Z  level 


Tables  7.14a  -  7.15b  Functional  Destination  Choice  Predictions  by  Market  Segments 
1,1  Interaction  Included^''' 


Table  7.14a  Quick  Parking 
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g 


Segment  1 
TCLS 


1 

2 

3 

30 
(30) 

26 
(25) 

30 
(31) 

5 
(5) 

14 
(15) 

16 
(15) 

Std. 

Parameters 

Deviation 

Row:   1.013 

(0.220) 

Col.  (1):  -0.582 

(0.284) 

Col.  (2):  -0.140 

(0.216) 

1,1  Inter- 

action   1.168 

(0.168) 

Goodness-of-fit  x  •  0-23 


Table  7.14b  Quick  Parking 


Segment  2 
TCLS 
1 2 3 


'^  (23) 

26 
(28) 

27 
(25) 

-    <t, 

12 
(10) 

12 
(14) 

Row:   1.111  (0.244) 
Col.  (1):  -0.633  (0.319) 
Col.  (2):  -0.026  (0.228) 
1,1  Inter- 
action   0.957  (0.594) 

Goodness-of-fit  x^=  0-83 


Table  7.15a   Close  to  Home 


g 


1 

Segment 
TCLS 
2 

2 

3 

Row:  0.924 
Col.  (1):  -0.336 
Col.  (2):   0.026 
1,1  Inter- 
action   0.924 

Goodness-of-fit  x^'- 

Bow:   1.228 
Col.  (1):  -0.924 
Col.  (2):  -0.187 
1,1  Inter- 
action   1.339 

Goodness-of-fit  x^- 

^   28 
(28) 

25 
(25) 

25 
(25) 

(0.218) 
(0.280) 
(0.228) 

6 
"   (6) 

14 
(14) 

13 
(13) 

(0.509) 
0.02 

Table 

1 

7.15b   Close 

Segment  2 
TCLS 
2 

to  Home 
3 

^   25 
(25) 

27 
(28) 

32 
(31) 

(0.256) 
(0.340) 
(0.216) 

c    3 
(3) 

12 
(11) 

IS 
(16) 

(0.652) 
0.24 

*KFL  and  TCLS  are  retail  floor  area  and  line  haul  travel  time  categories  defined 

in  Figure  7.1 
''Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*x2  value  significance  at  5Z  level 


169 

column  parameteis  by  Tl  and  T2.  Each  parameter  for  a  way  of  classi- 
fication is  examined  relative  to  the  last  category.  KFL  categories 
1  and  2  represent  high  and  low  to  medium  retail  floor  area  respective- 
ly. TCLS  categories  1,  2  and  3  represent  high,  medium  and  low  line 
haul  travel  time  ranges  respectively.  The  parameter  F  corresponds  to 
the  effect  of  the  high  floor  area  category  relative  to  the  low  floor 
area  class.  Tl  and  T2  represent  the  effects  of  high  and  medium  line 
haul  travel  time  ranges  compared  with  the  low  time  category. 

A  positive  F  parameter  implies  that  the  probability  of  a  function- 
al destination  choice  increases  with  increasing  retail  floor  area. 
A  negative  value  for  either  of  the  parameters  Tl  and  T2  means  that  the 
probability  of  a  functional  destination  choice  decreases  with  increas- 
ing travel  time.  It  is  noticed  that  the  parameters  F  and  T,  and  T^ 
have  the  appropriate  signs  in  all  the  models  estimated  in  Tables  7  4a 

-  7.9b  for  the  main  effects  models.  The  same  observation  applies  to 
the  models  that  include  a  1,1  interaction  effect  as  shown  in  Tables 
7.10a  -  7,15b. 

The  Effectiveness  of  Attitudinal  Orientation  for  Market  Segmentation 

One  of  the  questions  addressed  was  whether  the  attitudinal 
orientation  of  groups  of  individuals  causes  them  to  behave  differently 
in  their  choice  of  functional  shopping  destinations.   In  Tables  7.4a 

-  7.9b  and  Tables  7.10a  -  7.15b  segment  1  for  an  attitudinal  factor 
indicates  insignificance  of  the  attitudinal  descriptor  to  the  group 
of  individuals  and  segment  2  indicates  strong  importance  of  the 
descriptor.  Assuming  that  each  multinomial  response  relation  model 
fits  the  observed  response  pattern  well  for  each  attitudinal  market 
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segment,  a  statistical  test  of  whether  the  parameter  vectors  for  the 
two  market  segments  are  different  can  establish  whether  the  market 
segment  responses  to  functional  destinations  are  different.  Such 
tests  for  attitudinal  factors  "Wide  Selection,"  "Quick  Parking"  and 
"Close  to  Home"  in  Tables  7.5,  7.8  and  7.9  are  discussed  in  detail 
at  the  end  of  this  chapter.  All  of  them  except"Quick  Parking"  indicate 
a  significant  difference  in  the  response  patterns  for  the  market  seg- 
ments in  each  case.  For  attitudinal  factors  "Many  Employees,"  "Un- 
congested  Roads"  and  "Several  Stores,"  in  Tables  7.4,  7.6  and 
7,7  respectively,  a  main  effects  model  adequately  fits  the  observed 
response  pattern  for  one  market  segment  while  an  interaction  model  is 
needed  for  good  prediction  for  the  other  market  segment.  This  basic 
difference  in  the  models  for  the  two  attitudinal  market  segments  is 
sufficient  basis  for  their  response  patterns  to  differ.  A  comparison 
of  the  parameters  estimated  conveys  this  message  clearly. 

That  there  is  statistical  difference  in  the  response  patterns  for 
two  attitudinal  market  segments  is  not  sufficient  reason  to  consider 
the  attitudinal  segmentation  worthwhile.  One  would  also  like  to  know 
if  the  direction  of  change  of  the  parameters  for  the  two  market  seg- 
ments is  as  expected  for  each  attitudinal  descriptor.   In  Tables  7.4a 
and  7.4b  for  "Many  Employees"  it  is  noticed  that  the  F  parameter  for 
segment  2  is  about  sixty  percent  greater  than  that  for  segment  1 
which  indicates  that  those  who  like  to  shop  where  there  are  many 
employees  also  like  to  shop  where  the  retail  floor  area  is  greater  - 
or  where  the  retail  activity  is  more  intensive.  It  is  also  noticed 
that  the  parameters  Tl  and  T2  are  of  a  greater  magnitude  for  segment  1 
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than  for  segment  2.  This  implies  that  those  who  consider  having  many- 
employees  at  their  shopping  areas  do  not  consider  travel  time  as 
important  in  satisfying  their  need. 

The  remarks  made  above  for  "Many  Employees"  apply  equally  well 
to  the  attitudinal  description  "Wide  Selection"  as  can  be  observed 
in  Tables  7.5a  and  7.5b.   It  is  noticed  that  the  two  attitudinal 
factors  essentially  have  the  same  effect  in  market  segmentation  for 
predicting  functional  destination  choice. 

Considering  the  attitudinal  factors  "Uncongested  Roads'^  for 
trfiich  model  results  are  summarized  in  Tables  7.6a  and  7.6b, it  is 
noticed  that  the  effect  of  shopping  area  size  does  not  differ  signi- 
ficantly for  the  two  market  segments.  The  interpretation  given  to 
"Uncongested  Roads"  by  the  respondents  is  not  clear.  Assuming  a 
strong  preference  for  uncongested  roads  implies  a  greater  affinity 
to  freeways  or  expressways,  one  can  explain  the  difference  in  the  Tl 
and  T2  parameters  for  the  two  market  segments.  Since  individuals  in 
segment  1  travel  more  on  local  roads  they  are  more  sensitive  to  the 
distance  travelled  and  hence  travel  time,  since  the  travel  speed  is 
usually  less   on  lower  type  facilities.  The  lower  magnitude  of  the 
11  and  T2  parameters  for  segment  2  indicates  that  with  a  higher  travel 
speed  on  higher  type  facilities  like  freeways,  travel  time  does  not 
become  as  important  any  longer. 

The  remarks  made  above  for  the  attitudinal  factor  "Many  Employees" 
apply  in  principle  to  "Several  Stores"^  the  results  for  which  are  in 
Tables  7.7a  and  7.7b.   It  is  noticed  that  the  Tl  parameter  is  not 
much  smaller  in  magnitude  for  segment  2  compared  with  segment  1. 
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A  likely  reason  is  that  those  who  prefer  several  stores  consider 
travel  time  (or  indirectly  distance)  as  a  controlling  factor.  It  is 
possible  that  several  clusters  or  small  units  of  retail  outlets  devel- 
op around  downtovm  or  central  areas.  The  travel  time  towards  the 
central  area  of  a  city  is  a  formidable  consideration  even  for  those 
who  expect  to  satisfy  their  need  for  several  retail  outlets. 

Tables  7.8a  and  7.8b  show  that  there  is  no  essential  difference 
between  the  response  patterns  for  market  segments  1  and  2  for  atti- 
tudinal  factor  "Quick  Parking."  This  is  mainly  for  the  reason  that 
only  shopping  trips  to  major  shopping  areas  were  considered  for 
which  parking  was  not  a  real  problem  for  any  of  them. 

The  difference  between  individuals  who  do  and  those  who  do  not 
feel  strongly  about  shopping  close  to  home  is  clearly  borne  out  in 
Tables  7.9a  and  7.9b.  It  is  clear  that  the  line  haul  travel  time 
parameters  Tl  and  T2  are  of  a  much  greater  magnitude  for  segment  2 
that  feels  strongly  about  the  description  "Close  to  Home,"  just  as 
expected. 

The  above  discussion  on  whether  the  functional  destination 
response  patterns  implied  for  different  attitudinal  market  segments 
conform  with  what  one  would  normally  expect  in  practice  is  essenti- 
ally applicable  to  the  models  that  include  a  1,1  interaction  summar- 
ized in  Tables  7.10a  -  7.15b.  The  interaction  models  only  help 
bring  out  expected  patterns  that  the  main  effects  models  could  not 
adequately  portray  because  of  the  poor  model  fit  without  the  inter- 
action consideration. 
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Prediction  of  Functional  Shopping  Destination  Choices  Defined  With 
Respect  to  Retail  Floor  Area  and  Airline  Distance  by  Attitudinal 
Factor  Market  Segments 

The  results  of  a  predictive  model  for  the  total  sample  that  pre- 
dicts frequencies  of  choice  of  functional  shopping  destinations  defined 
with  respect  to  Retail  Floor  Area  and  Airline  Distance  are  given  in 
Table  7.2.   It  is  noticed  that  the  predictive  accuracy  of  the  model 
is  very  good.  To  determine  whether  market  segmentation  according  to 
attitudinal  factors  produces  different  patterns  of  functional  shopping 
destination  choice,  models  were  fitted  for  such  market  segment.  The 
goodness-of-fit  X  values  are  presented  in  Table  7.16  for  these  main 
effects  models.  Tables  7.17a  to  7.22b  present  the  observed  and 
predicted  cell  frequencies  as  well  as  the  model  parameters. 

Except  for  the  cells  of  low  observed  cell  frequencies,  the  per- 
centage of  correct  cell  frequency  prediction  is  very  good  for  all 

the  models.  It  was  not  necessary  to  fit  any  interaction  effect  since 

2 
the  goodness-of-fit  X  values  are  not  significant  at  the  5  percent 

level.  Differences  exist  between  sets  of  parameters  estimated  for 

the  total  sample  model  in  Table  7.2  and  for  the  models  for  market 

segments  summarized  in  Tables  7.17a  to  7.22b.  There  is  reason 

to  believe  that  the  market  segmentation  is  necessary  at 

an  operational  level.  It  is  clear  that  functional  shopping 

destinations  defined  by  Retail  Floor  Area  and  Airline  Distance  are 

predicted  with  greater  stability  and  accuracy  for  both  the  total 

sample  and  attitudinal  market  segments  than  functional  shopping 
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Table  7.16  Likelihood  Ratio  Goodness-of-Fit  x   Values  for  Fitting  KFL 
X  DCLSl  Contingency  Tables. 


Destination  Choice  Attitudinal 
Factor  Used  for  Market  Segmentation 


Main  Effects^'^ 
Model 

Market  Segment 

1        2 


Many  Employees 
Wide  Selection 
Uncongested  Roads 
Several  Stores 
Quick  Parking 
Close  to  Home 


0.40 

4.46 

0.60 

2.87 

4.65 

0.28 

0.77 

3.00 

1.56 

2.13 

3.08 

3.29 

^KFL  and  DCLS  refer  to  Retail  Floor  Area  and  Airline  Distance 
Categories  defined  in  Figure  7.2. 

2  ? 

Each  Likelihood  Ratio  Goodness  of  fit  x  value  has  2d.f. 


3  2 

None  of  the  x  values  is  significant  at  the  0.05  significance  level. 
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Tables  7.17a  -  7.18b  Functional  Destination  Choice  Predictions  by  Market  Segments 


a,b 


Table 

1 

7.17a  Many  Employees 

Segment  1 
DCLS 

2        3 

Parameters 

Row:   0.615 
Col.  (1):  -1.946 
Col.  (2):  -1.540 

Goodness-of-flt  f}:   0. 

Std. 
Deviation 

CM 

4 
(4) 

6 

(5) 

27 
(28) 

(0.278) 
(0.436) 
(0.367) 

2 
(2) 

3 
(4) 

15 
(14) 

,40 

Table  7.17b  Many  Employees 


g 


1 

Segment  2 
DCLS 
2 

3 

Row:   1.067 
Col.  (1):  -1.173 
Col.  (2):  -0.370 

Goodness-of-flt  x^- 

19 
(23) 

43 
(44) 

62 
(58) 

(0.177) 
(0.224) 
(0.170) 

7 
(3) 

15 
(14) 

22 
(26) 

4.46 

Table  7.18a  Wide  Selection 


g 


1 

Segment  1 
DCLS 
2 

3 

Row:   0.609 
Col.  (1):  -2.216 
Col.  (2):  -0.712 

2 
Goodness-of-fit  x  '• 

4 
(4) 

18 
(19) 

36 
(34) 

(0.222) 
(0.427) 
(0.235) 

2 
(2) 

9 

(8) 

19 
(21) 

0.60 

Table  7.18b  Wide  Selection 


g 


1 

Segment  2 
DCLS 
2 

3 

Row:   1.188 
Col.  (1):  -1.005 
Col.  (2):  -0.574 

Goodness-of-fit  x^= 

iH 

20 
(23) 

31 
(30) 

54 
(52) 

(0.202) 
(0.228) 
(0.198) 

M 

6 
(3) 

9 
(10) 

17 
(19) 

2.87 

*KFL  and  DCLS  are  retail  floor  area  and  airline  distance  categories  defined  in 

figure  7.  2 
"Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*x2  value  significant  at  5%  level 
**X  value  significant  at  12  level 
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Tables  7.19a  -  7.20b  Functional  Destination  Choice  Predictions  by  Market  Segments' 


a,b 


Table  7.19a  Uncongested  Roada 


Segment  1 
DCLS 
1 2 


14 

34 

70 

(18) 

(34) 

(66) 

6 

13 

27 

(2) 

(13) 

(31) 

Parameters 


Row:  0.942 
Col.  (1):  -1.579 
Col.  (2):  -0.725 


Std. 

Deviation 

(0.296) 
(0.246) 
(0.178) 


Goodness-of-flt  x^'  4.65 


Table  7.19b  Uncongested  Roads 


g 


Segment  2 
DCLS 
1 2 


9 

14 

21 

(9) 

(15) 

(20) 

3 

6 

8 

(3) 

(5) 

(9) 

Row:  0.951  (0.286) 
Col.  (1):  -0.882  (0.342) 
Col.  (2):  -0.372       (0.290) 

Goodness-of-fit  x^-  0-28 


Table  7.20a  Several  Stores 


i 


1 

Segment  1 
DCLS 
2 

3 

Row:     0.665 
Col.    (1):   -1.268 
Col.    (2):   -0.981 

Goodness-of-fit  x  • 

^ 

6 
(7) 

8 
(8) 

21 
(20) 

(0.290) 
(0.376) 
(0.338) 

CM 

3 
(2) 

4 
(4) 

U 
(12) 

0.77 

Table  7.20b  Several  Stores 


i 


1 

Segment 
DCLS 
2 

2 

3 

Row:     1.038 
Col.    (1):   -1.408 
Col.    (2):   -0.536 

Goodness-of-flt  x^- 

•■t 

17 
(20) 

41 
(41) 

69 
(66) 

(0.173) 
(0.233) 
(0.169) 

CM 

6 
(3) 

14 
(14) 

25 
(28) 

3.00 

*KFL  and  DCLS  are  retail  floor  area  and  airline  distance  categories  defined  in 

Figure  7  .2 
l>Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*x2  value  significant  at  5%  level 
**X^  value  significant  at  12  level 
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Table  7.21a  Quick  Parking 
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i 


Segaenc  1 
DCLS 
1        2 


12 

26 

48 

(14) 

(26) 

(46) 

5 

10 

20 

(3) 

(10) 

(22) 

Row 
Col.  (1) 
Col.  (2) 


Paraaetera 

0.899 
-1.386 
-0.636 


Std. 
Deviation 

(0.200) 
(0.270) 
(0.206) 


Goodness-of-flt  x  •  1.56 


Table  7.21b  Quick  Parking 


g 


1 

SegMBt  2 
DCLS 
2 

3 

Row:     0.998 
Col.    (1):   -1.352 
Col.    (2):   -0.626 

Goodness-of-fit  x^= 

-<s, 

23 
(23) 

42 
(40) 

(0.221) 
(0.290) 
(0.222) 

~  <;, 

8 
(8) 

16 
(18) 

2.13 

Table  7.22a  Close  to  Hoae 


g 


1 

Segnent  1 
DCLS 
2 

3 

Sow:      0.860 
Col.    (1):   -1.065 
Col.    (2):   -0.564 

Goodneas-of-fit  x   • 

-a;, 

23 
(21) 

41 
(40) 

(0.208) 
(0.259) 
(0.218) 

"  <^> 

10 
(12) 

17 
(18) 

3.08 

Table  7.22b  Close  to  Hone 


Segment  2 
DCLS 
1        2 3 


9 

25 

50 

(10) 

(28) 

(46) 

3 

9 

18 

(2) 

(6) 

(22) 

Col. 
Col. 


Row:  1.030 
(1):  -1.735 
(2):  -0.693 


(0.212) 
(0.296) 
(0.210) 


Goodness-of-flt  x^'  3.29 


"KFL  and  DCLS  are  retail  floor  area  and  airline  distance  categories  defined  in 

Figure  7.2 
*'Cgll  numbers  are  predicted  frequencies  %rf.th  observed  frequencies  in  parenthesis 
*¥  value  significant  at  5Z  level 
*  x^  value  significant  at  IZ  level 
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destinations  defined  by  Retail  Floor  Area  and  Line  Haul  Travel  Time. 
This  is  a  welcome  trend  that  must  be  further  investigated  since  it 
could  make  the  application  of  the  methodology  of  multinomial  response 
relations  to  predicting  functional  shopping  destinations  much  more 
practical.  The  reason  is  that  airline  distances  are  easier  to  compute 
than  travel  times. 

The  definition  of  airline  distance  range  categories  is  equiv- 
alent to  that  for  line  haul  travel  time  in  that  an  increasing  cate- 
gory class  number  implies  a  decreasing  airline  distance.  Tlie  effects 
for  the  Retail  Floor  Area  by  Airline  Distance  models  fitted  in 
Tables  7.17a  to  7.23b  are  also  relative  to  the  last  category.  All 
the  remarks  in  connection  with  model  interpretation,  the  need  for 
attitudinal  market  segmentation  and  the  consistency  in  the  response 
patterns  implied  by  the  models  in  connection  with  the  Retail  Floor 
Area  by  Line  Haul  Travel  Time  functional  destination  models  in  Tables 
7,4a  to  7.15b  apply  essentially  to  the  models  for  Retail  Floor  Area 
by  Airline  Distance  functional  destinations  summarized  in  Tables  7.17a 
to  7.23b,  That  the  Airline  Distance  models  show  good  promise  has 
been  discussed  above.  Statistical  tests  to  establish  the  need  for 
the  market  segmentation  are  discussed  in  detail  at  the  end  of  this 
chapter. 

Prediction  of  Functional  Shopping  Destination  Choices  Defined  for 
Approachability  Market  Segments 

Approachability  market  segments  1,  2,  3  refer  to  poor,  medium 
and  good  approachability  to  a  shopping  destination  from  a  given  direc- 
tion, as  defined  in  Chapter  4.  The  cell  frequencies  of  some  of  the 
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functional  classification  tables  are  so  low  that  the  results  being 
discussed  can  only  be  for  illustration.  The  prediction  cell  fre- 
quencies and  main  effects  model  parameters  are  shown  in  Tables  7.23a 
to  7.23c  for  Retail  Floor  Area  by  Line  Haul  Travel  Time  functional 
shopping  destinations  and  in  Tables  7.24a  to  7.24c  for  Retail  Floor 
Area  by  Airline  Distance  functional  shopping  destination  categories. 
It  is  not  surprising  that  most  of  the  models  are  not  good  enough 
since  the  approachability  measure  was  found  not  to  have  any  effect 
on  destination  choices  in  Chapter  4.  In  response  to  direct  questions 
on  why  they  chose  certain  destinations  in  Kannel's  survey  [10],  the 
respondents  rarely  stated  directly  a  reason  that  indicated  a  factor 
related  to  the  quality  of  the  transportation  system  or  traffic  flow. 
This  situation  arises  because  of  the  special  nature  of  the  sample 
selected  or  the  nature  of  the  transportation  network  in  the  Indiana- 
polis area  relative  to  shopping  area  locations. 

The  Feasibility  of  Having  Attitudinal  and  Approachability  Factors  in 
the  Same  Model  for  Functional  Shopping  Destination  Prediction 

The  small  sample  size  available  did  not  permit  any  partioning 
into  four  subsamples.  It  was  therefore  not  possible  to  consider  any 
market  segmentation  derived  from  a  2-way  classification  using  an 
attitudinal  factor  and  approachability.  It  is  still  instructive  to 
determine  if  attitudinal  factors  and  approachability  act  independently, 

in  which  case  attitudinal  and  transportation  network  level-of -service 
considerations  can  be  incorporated  in  the  same  model.  With  attitudi- 
nal factors  in  two  categories  defined  and  approachability  in  three 
categories.  Tables  7.25  to  7.31  show  results  of  attempts  to 
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Tables  7.23a  -  7.23c  Functional  Destination  Choice  Predictions  by  Market  Segments*'^ 


Table  7.23a  Approachablllty 

Segment  1 

TCLS  Std. 

_1 2 3  Parameters   Deviation 

Sow:   2.398      (0.464) 

Col.  (1):  -0.314       (0.302) 

Col.  (2):  -0.550      (0.324) 


g 


17 

14 

24 

(18) 

(15) 

(22) 

2 

1 

2 

(1) 

(0) 

(4) 

Goodness-of-flt  x^:  4.26*  (1  d.f.) 


Table  7.23b  Approachablllty 

Segment  2 
TCLS 


i 


1 

2 

3 

w   17 
(22) 

26 

(28) 

31 
(26) 

7 
'^   (2) 

11 
(9) 

13 
(19) 

Row; 

0.916 

(0.216) 

Col.  (1): 

-0.606 

(0.253) 

Col.  (2): 

-0.173 

(0.223) 

Goodnes8-o£-flt  x^^  10.64**  (2  d.f.) 


Table  7.23c  Approachablllty 

Segment  3 
TCLS 


i 


1 

2 

3 

H   8 
(13) 

15 
(10) 

9 
(9) 

6 
''   (1) 

10 
(15) 

6 
(6) 

Row: 

0.374 

(0.276) 

Col. 

(1): 

-0.069 

(0.370) 

Col. 

(2): 

0.511 

(0.326) 

Goodness-of-flt  x^:  11.95**  (2  d.f.) 


*KFL  and  TCLS  are  retail  floor  are  and  line  haul  travel  time  categories  defined 

in  Figure  7.1 
*'Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*y2  value  significant  at  5%  level 
*  X^  value  significant  at  IZ  level 
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Tables  7.24a  -  7.24c  Functional  Destination  Choice  Predictions  by  Market  Segments**'* 


Table  7.24a  Approachablllty 

Segment  1 

DCLS  Std. 

_1 2 3  Parameters   Deviation 

Row:     2.400  (0.465) 

Col.  (1):     1.792  (0.480) 

Col.  (2):     1.609  (0.488) 


g 


28 

23 

5 

(30) 

(21) 

(4) 

Z 

2 

1 

(0) 

(4) 

(1) 

Goodneas-of-flt  x^:  7.43*  (1  d.f.) 


Table  7.24b  Approachablllty 

Segment  2 
DCLS 


g 


1 

2 

3 

r4 

29 
(35) 

36 
(31) 

9 
(9) 

C« 

12 
(6) 

15 
(20) 

4 
(4) 

Row: 

0.916 

(0.216) 

Col.    (1): 

1.149 

(0.320) 

Col.    (2): 

1.367 

(0.311) 

Goodness-of-flt  x^:  7.14*  (2  d.f.) 


Table  7.24c  Approachablllty 


g 


N 


Segment  3 

DCLS 

1 

2 

3 

15 

15 

1 

(19) 

(12) 

(1) 

11 

11 

1 

(7) 

(14) 

(1) 

Row:     0.375  (0.276) 

Col.    (1):      2.565  (0.731) 

Col.    (2):     2.565  (0.731) 

Goodness-of-fit  x^:  4.05  (  2  d.f.) 


•kFL  and  DCLS  are  retail  floor  area  and  airline  distance  categories  defined  in 

Figure  7.2 
''Cell  nuaibers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*y2   value  significant  at  5%  level 
**X^  value  significant  at  12  level 


Table  7.25  Prediction  of  the  Joint  Relationship  Between  "Many  Employees"  (SCI)  and 
Approachability  (AP)* 
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AP 

2 


■H 

15 
(12) 

26 
(35) 

13 
(7) 

U 

cn 

CM 

45 
(48) 

79 
(70) 

41 
(47) 

Parameters 


Col. 
Col. 


Row:  -1.117 
(1):  0.105 
(2):  0.665 


Std. 

Deviation 

(0.156) 
(0.188) 
(0.167) 


Goodness-of-fit  x^:  9.27* 


Table  7.26  Prediction  of  Joint  Relationship  Between  "Wide  Selection"  (SDl)  and 
Approachability  (AP)* 


AP 

1 

2 

3 

Row: 

-0.455 

(0. 

23 

41 

21 

.139) 

■H 

(25) 

(44) 

(16) 

Col. 

(1): 

0.105 

(0. 

.187) 

iH 

Col.    (2): 
Goodness-( 

0.665 
of-fit  x^: 

(0. 
2.61 

.167) 

s 

37 

64 

33 

C4 

(35) 

(61) 

(38) 

Table  7.27  Prediction  of  the  Joint  Relationship  Between  "Shortest  Time"  (SEl)  and 
Approachability  (AP)* 


1 

AP 
2 

3 

Row:   -2.354 
Col.    (1):      0.105 
Col.    (2):      0.665 

Goodness-of-fit  x   '• 

iH 

fH 
H 

5 

(3) 

9 
(12) 

5 
(4) 

(0.239) 
(0.187) 
(0.167) 

CM 

55 
(57) 

96 
(93) 

49 
(50) 

2.23 

Table  7.28  Prediction  of  the  Joint  Relationship  Between  "Uncongested  Roads"  (SHI) 
and  Approachability  (AP)* 


AP 

2 


H   ■ 

CM 

44 
(45) 

77 
(80) 

39 
(35) 

16 
(15) 

28 
(25) 

15 
(19) 

Row: 

0.998 

(0.152) 

Col.    (1): 

0.105 

(0.188) 

Col.    (2): 

0.665 

(0.167) 

Goodness-of-fit  x^^  2.42 


■Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 
*X^  value  significant  at  5Z  level 
**^2  value  significant  at  IZ  level 
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Table  7.29  Prediction  of  the  Joint  Relationship  Between  "Several  Storea"  (SKI) 
and  Approachabllity  (AP)* 


1 

AP 

2 

3 

Parameters 

Row:   -1.192 
Col.    (1):     0.105 
Col.    (2):     0.665 

Goodness-of-flt  x^=   5. 

Std. 

Deviation 

■H 

s  ■ 

CNJ 

14 
(13) 

46 
(47) 

25 
(31) 

81 
(74) 

13 
(7) 

41 
(47) 

(0.160) 
(0.187) 
(0.167) 

,92 

Table  7.30  Prediction  of  the  Joint  Relationship  Between  "Quick  Parking"  (SLl)  and 
Approachabllity  (AP)* 


1 

AP 

2 

3 

^      32 
(26) 

56 
(63) 

29 
(27) 

Row:     0.119                (0.135) 
Col.    (1):     0.105                (0.187) 
Col.    (2):     0.665                (0.167) 

c      28 
"    (34) 

49 
(42) 

25 
(27) 

Goodness-of-flt  x^:   *-53 

Table  7.31  Prediction  of  the  Joint  Relationship  Between  "Close  to  Home"  (ao.)  and 
Approachabllity  (AP)* 


AP 
2 


rH          30 

(28) 

52 
(53) 

27 
(27) 

N      30 
(32) 

53 
(52) 

27 
(27) 

Row: 

-0.027 

(0.135) 

Col.    (1): 

0.105 

(0.187) 

Col.    (2): 

0.665 

(0.167) 

Goodness-of-flt  x^'   0'24 


*Cell  numbers  are  predicted  frequencies  with  observed  frequencies  in  parenthesis 

X^  value  significant  at  5Z  level 
**X^   value  significant  at  12  level 
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reproduce  2-way  contingency  table  cell  frequencies  defined  by  the  two 
types  of  factors.  Except  for  the  factor  of  "Many  Employees"  all 
the  attitudinal  factors  relative  to  shopping  destination  choice  act 
independently  of  the  approachability  measure  and  the  level  of  accuracy 
of  cell  frequency  prediction  is  good.  It  has  been  pointed  out  that 
approachability  is  not  a  significant  factor  in  shopping  destination 
choice  for  the  sample  used  in  this  research.  The  two-category 
attitudinal  factors  are  also  too  gross.  No  indication  is  given, 
however,  that  the  attitudinal  factors  act  in  conflict  with  the 
approachability  measure.  This  expectation  of  independence  can  be 
extended  to  direct  measures  that  the  attitudinal  factors  are  related 
to. 

Testing  the  Hypothesis  That  the  Attitudinal  Market  Segments  Used  in 
Functional  Destination  Choice  Prediction  Represent  Different 
Behavioral  Populations 

It  was  discussed  in  Chapter  5  that  under  maximum  likelihood 
estimation  with  a  large  enough  sample  size,  the  joint  distribution 
of  the  parameters  estimated  is  multivariate  normal.  To  test  the 
equality  of  two  parameter  vectors  one  must  first  test  the  equality  of 
their  covariance  matrices. 

Testing  the  Equality  of  Two  Covariance  Matrices 

Three  multivariate  statistical  testing  criteria  were  used  to 
test  the  equality  of  parameter  covariance  matrices  for  the  attitudinal 
market  segments.  These  are:  1.  Hotelling's  Trace  Criterion 

2.  Pillai's  Trace  Criterion 

3.  Roy's  Largest  Root  Criterion 
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The  three  tests  and  associated  tables  are  discussed  in  Pillai 

[62]  . 

I 

The  hypothesis  that  p  x  p  covariance  matrices  g  and  e„  are 
equal  is  equivalent  to 

H^:  Y^  =  1  i  =  l,...p 
where  the  y- 's  are  the  characteristic  roots  of  T.,    l^   ;  ie.  the  roots 
of 

i 

The  alternate  hypothesis  is 

i  =  1 

Sample  estimates  of  Ij^  and  £„  are  defined  as  S,  and  S  respectively. 
If  the  sample  sizes  involved  in  the  estimation  of  Z,   and  E  are  N, 
and  N  respectively,  the  quantities  M  and  N,  used  for  the  three 
tests,  are  defined  as  follows: 


Ni-p-2         N  -p-2 

M  =  and  N  =  -i 

2  2 

-1 
The  p  characteristic  roots  of  S  S^  are  \.     i  =  l,...p. 

The  roots  of  S^CS^  +  S J"-^  are  e^,  i  =  1  =  l...p  where 


^i 
9^  =  1  =  1,  ...p. 

l^i 
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Hotelling's  Trace  Criterion  uses  the  trace  of  the  matrix  S  $2" 
which  is  given  by 

uCp)  =  ?  a. 

i=l   ^ 

The  hypothesis  that  Z  =  E  is  rejected  at  a  given  significance 
level  if  U*^-'  exceeds  the  tabular  value  [62]  for  p,  M  and  N. 

Pillai's  Trace  Criterion  performs  the  test  of  the  equality  of  I, 
and  E^  using  the  trace  of  the  matrix  S-^CS,  +  S2)"''"  which  is 


vcp)=  f 


e 
1=1 

V^J  is  compared  with  the  tabular  value  [62]  for  p,  M  and  N  at  a 
given  significance  level  and  the  null  hypothesis  rejected  if  V^P^ 
exceeds  the  tabulated  value. 

Roy's  Largest  Root  Criterion  uses  the  largest  characteristic  root 
of  the  matrix  S^CSj^  +  S2)   ,  9  Cnax)  and  compares  it  with  a  tabular 
value  [62]  for  p,  M  and  N  at  a  given  signifance  level.  The  null 
hypothesis  is  rejected  if  6 (max)  is  greater  than  the  tabular  value. 

The  development  of  the  above  test  criteria  was  based  on  covariance 
matrice  estimates  estimated  directly  from  sample  observations.  When 
the  covariance  matrices  are  estimated  from  maximum  likelihood  estim- 
ation, a  correction  must  be  made  for  the  degrees  of  freedom  lost  in 
parameter  estimation  in  the  use  of  N,  and  N-  in  the  computation  of  M 
and  N.  By  the  implication  of  the  orthogonality  of  sets  of  effects 
corresponding  to  ways  of  calssification  in  the  models  used,  the 
covariance  terms  between  orthogonal  effects  are  necessarily  zero. 
On  the  average,  for  the  range  of  paraimeter  vector  sizes  involved  in 
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the  tests  performed  in  this  study,  one  parameter  is  estimated  for 
each  component  of  a  covariance  matrix.  To  correct  for  this  loss  of 
degree  of  freedom  N^^-l  and  N--1  are  used  in  the  place  of  N,  and  N2 
respectively  in  the  computation  of  M  and  N  required  for  performing 
the  hypothesis  testing.  The  sample  sizes  involved  in  the  application 
of  the  tests  in  this  research  are  so  large  that  this  correction 
makes  an  indiscernible  difference. 

Testing  the  Equality  of  Two  Parameter  Vectors 

If  the  hypothesis  that  the  covariance  matrices  of  parameter 
vectors  are  equal,  discussed  above,  is  rejected  then  it  is  obvious 
that  the  two  parameter  vectors  involved  belong  to  two  different 
normal  populations.  With  regard  to  the  problem  at  hand  this  implies 
the  market  segmentation  by  attitudinal  orientation  was  necessary 
since  the  statistical  distributions  of  the  parameter  vectors  of  the 
two  market  segments  involved  being  different  necessarily  implies  that 
the  response  patterns  of  the  two  market  segments  to  functional  des- 
tination choice  are  different. 

A  multivariate  test  will  be  discussed  that  assumes  that  the 

hypothesis  of  the  equality  of  covariance  matrices  is  not  rejected. 

2 
The  statistic  used  is  Hotelling's  T  [63]  .  Denoting  the  parameter 

vectors  for  the  two  market  segments  or  populations  by  x-^   and  x_, 

Hotelling's  statistic  is  given  by 


T 


-?   N1N-7  -1 

N  +N      -1   ~2   ~    ^  ^ 


1   2 
where 


5  =  ?i  *  52 
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where  S  and  S™  are  estimates  of  the  covariance  matrices  of  the  two 
populations.  The  previous  discussion  on  corrections  applied  to  N,  and 
Nj  applies  here  as  well, 


The  statistic  F™  = 


Nj+N^-p-l      j2 


N  +N  -2 
1   2 


hjis  an  F  distribution  with  p  and  N.+N2-P-I  degree  of  freedom.  The 
test  of  the  equality  of  parameter  vectors  is  performed  by  comparing 
Fj  with  the  appropriate  tabulated  value  in  F-tables  at  a  given  signi- 
ficance level. 

Testing  the  Hypothesis  That  two  Attitudinal  Market  Segment  Multinomial 
Response  Relation  Model  Parameter  Vectors  Have  the  Same  Multivariate 
Normal  Distribution 

It  has  been  previously  discussed  that  if  the  multinomial  response 
relation  functional  destination  choice  model  parameter  vectors  for 
two  attitudinal  market  segments  belong  to  the  same  multivariate  normal 
population  then  there  is  no  need  for  the  market  segmentation  with  the 
attitudinal  factor  in  question.  The  preceeding  sections  presented 
appropriate  multivariate  statistical  tests  for  determining  such  a  need- 
for  attitudinal  market  segmentation.  The  results  are  summarized  in 
Table  7.32.  It  is  noticed  that  the  attitude  of  respondents  to  the 
need  for  quick  parking  does  not  affect  their  response  patterns  in 
their  choice  of  functional  destinations.  The  possible  reason  for 
this  has  been  discussed  in  earlier  sections.  All  other  results  in 
Table  7.32  justify  the  attitudinal  market  segmentation  adopted. 
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The  Use  of  Travel  Data  on  Zonal  Shopping  Destination  Choice  to 
Provide  an  Approximate  Verification  of  Some  Assumptions  Used  in 
Functional  Destination  Choice  Modelling 

Working  on  the  assumption  that  shopping  destinations  defined 
along  traffic  zonal  boundaries  may  suffice  for  functional  destination 
choice  models  for  groups  of  travellers,  1964  travel  data  on  shopping 
trips  to  zonal  destinations  were  used  to  roughly  verify  some  basic 
assumptions  that  are  implicit  in  the  functional  shopping  destination 
choice  modelling  discussed  in  this  chapter.  Details  of  the  empirical 
analysis  carried  out  can  be  found  in  Appendix  B. 

TTiat  travel  market  segmentation  is  necessary  to  model  the  differ- 
ent response  patterns  to  functional  shopping  destinations  of  differ- 
ent groups  of  people  is  clearly  established.  In  spite  of  the  defini- 
tion of  shopping  areas  along  traffic  zone  boundaries  being  unrealistic 
in  many  cases  the  impression  given  by  the  models  developed  in  Appendix 
B  is  that  Zonal  Retail  Floor  Area  or  Zonal  Retail  Employment  and 
Inter-zonal  Distance  are  adequate  for  modelling  the  shopping  patterns 
of  higher  income  groups,  with  improved  results  if  local  travel  is 
excluded  from  consideration.  There  was  not  sufficient  information  on 
1964  retail  land-use  for  the  whole  of  Indianapolis  to  enable  the 
identification  of  proper  shopping  area  boundaries.  Consistency  in 
the  response  patterns  for  subsamples  of  different  size  for  the  high 
income  market  segments  should  establish  the  complete  adequacy  of 
using  zonal  destinations  for  these  market  segments.  Statistical 
tests  discussed  in  Appendix  B  do  not  indicate  this  consistency, 
clearly  establishing  the  need  to  use  actual  shopping  area  boundaries 
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in  the  definition  of  functional  shopping  destinations.  The  lack  of 
further  data  on  1964  specific  shopping  areas  did  not  enable  the 
pursuance  of  the  issue  with  the  1964  data.  The  encouraging  results 
with  models  using  actual  shopping  areas,  as  discussed  in  this  chapter, 
strongly  support  the  call  to  discontinue  using  zonal  shopping 
destinations. 

The  empirical  analysis  discussed  in  Appendix  B  indicates  that 
urban  travellers  consider  few  categories  of  each  factor  that  defines 
functional  shopping  destinations.  Three  categories  of  Retail  Floor 
Area,  Retail  Employment  and  interzonal  Airline  Distance  ranges  were 
found  to  be  adequate  for  the  modelling  discussed  in  Appendix  B. 
Greater  numbers  of  classification  categories  did  not  display  any 
statistically  reproducible  patterns  of  functional  destination  choice 
behavior  or  monotonicity  or  regularity  of  the  effects  of  the  factors 
of  functional  destination  classification.  The  need  for  considering 
certain  interactions  in  model  development  is  also  borne  out  in  the 
analyses  discussed  in  Appendix  B. 

Summary 

The  stability  and  accuracy  of  models  that  predict  functional 
shopping  destinations  for  both  the  total  sample  and  market  segments 
are  demonstrated  by  the  empirical  analysis  discussed  in  this  research. 
The  findings  apply  generally  to  functional  shopping  destinations  de- 
fined by  Retail  Floor  Area  and  Line  Haul  Travel  Time  and  more  specifi- 
cally to  those  defined  by  Retail  Floor  Area  and  Airline  Distance.   The 
implication  of  this  in  operational  destination  choice  modelling  is  in- 
valuable since  airline  distances  are  easier  to  compute  than  travel 
times.  It  is  established  that  in  larger  classification  tables 


192 


the  fitting  of  certain  key  interactions  in  the  predictive  model  devel- 
opment greatly  improves  the  accuracy  of  the  results.  Approachability 
is  not  expected  to  play  an  important  role  in  functional  shopping 
destination  choice  modelling  because  of  the  background  of  the  sample 
used,  and  this  was  found  to  be  the  case. 

No  reason  was  found  why  market  segmentation  according  to  attitu- 
dinal  orientation  should  not  be  adopted  or  further  investigated  for 
operational  model  development.  Multivariate  statistical  tests  on 
model  parameters  for  different  attitudinal  market  segments  and  the 
interpretation  of  the  sense  of  the  functional  destination  choice  model 
parameters  strongly  support  patterns  of  shopping  behavior  that  one 
would  expect  in  practice.  The  need  for  attitudinal  market  segmenta- 
tion was  established  for  all  the  factors  considered  except  for  that 
relating  to  the  ability  to  find  a  parking  space  quickly,  a  plausible 
reason  for  that  being  given. 

A  discussion  of  empirical  analyses  on  functional  shopping 
destinations  carried  out  in  Appendix  B  strongly  supports  the  major 
objections  to  the  use  of  zones  to  define  retail  areas  and  the  irregul- 
arity of  the  effects  of  shopping  attraction  variables  like  zonal 
retail  floor  area  and  zonal  retail  employment.  The  view  that  urban 
travellers  usually  consider  a  small  number  of  categories  of  factors 
affecting  their  travel  behavior  is  supported. 

The  limited  sample  size  available  could  not  allow  the 
two-way  classification  of  attitudinal  factors  in  market  segmentation 
or  the  combination  of  attitudinal  orientation  and  approachability 
for  the  same  purpose.  An  investigation  of  the  joint  relationship 
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between  attitudinal  factors  and  approachability  gave  no  indication 
why  attitudinal  and  transportation  level  of  service  considerations 
would  conflict  in  the  same  predictive  model. 

In  spite  of  the  limitations  posed  by  the  small  sample  size 
available,  the  results  of  the  empirical  analysis  in  this  chapter 
strongly  recommend  the  adoption  of  the  multinomial  response  relation 
(MRR)  approach  to  functional  shopping  destination  choice  modelling 
in  further  investigations,  using  a  more  appropriate  data  base  and  for 
subsequent  adoption  in  operational  modelling. 
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CHAPTER  8 
CONCLUSIONS  AND  RECOMMENDATIONS 

Summary 

This  dissertation  presents  the  development  of  a  theoretical 
framework  that  allows  for  a  practical  modelling  of  destination  choice. 
The  spatial  choice  of  destinations  is  selected  in  order  to  highlight 
the  more  immense  problems  posed  by  spatial  choices  in  travel  demand  ana- 
lysis and  to  demonstrate  that  more  specific  travel  choices  can  be 
incorporated  in  the  resulting  modelling  framework  as  special  cases. 

Currently  available  techniques  for  disaggregate  analysis  of 
destination  choice  are  over-generalized  and  do  not  achieve  what  they 
claim  to  accomplish,  statistically.  The  present  formulation  of  the 
current  approaches  also  makes  it  virtually  impossible  for  aggregate 
forecasts  of  destination  choice  behavior  to  be  made. 

The  multinomial  response  relation  methodology  proposed  in  this 
research  enables  a  straightforward  computation  of  aggregate  forecasts 
with  regard  to  the  choice  of  types  of  shopping  destination  choice 
behavior.  This  is  because  the  approach  is  directly  tied  to  the 
sampling  design  used  in  data  collection.  A  major  advantage  of  the 
multinomial  response  relation  approach  is  that  it  is  not  demanding  on 
detailed  variables  since  the  classification  scheme  used  reduces 
continuous  variables  into  intervals.  This  allows  both  discrete 
and  continuous  variables  in  a  single  modelling  framework. 
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This  is  particularly  useful  where  a  detailed  and  sophisticated  data 
base  is  not  available  and  yet  the  best  information  must  be  extracted 
from  currently  available  data  in  order  to  plan  a  more  ambitious  data 
collection  effort.  A  further  advantage  of  the  multinomial  response 
relation  model  is  the  ability  to  include  interaction  effects  in  the 
analysis. 

To  obtain  a  common  choice  set  of  shopping  destination  type  alter- 
natives, a  functional  destination  concept  is  introduced.  This  is, 
basically,  a  classification  of  shopping  areas  with  respect  to  their 
attraction  measures  and  their  geographic  or  time  frajne  location  relative 
to  households  or  subzones  of  travel  origin.  The  two-way  classification 
used  for  functional  shopping  destination  definition  in  this  research 
is  only  a  result  of  the  limited  sample  size  for  the  data  available. 
The  definition  of  travel  market  segments  with  respect  to  socio-economic 
characteristics  or  attitudinal  orientation  becomes  necessary  in  order 
to  identify  the  different  patterns  of  travel  behavior  exhibited  by 
different  market  segments.  Attitudinal  orientation  is  the  means  of 
market  segmentation  examined  in  detail  in  this  research. 

The  number  of  people  in  a  given  travel  market  segment  that  are 
associated  with  a  given  functional  shopping  destination  classifica- 
tion is  predicted  by  the  multinomial  response  relation  model. 
Applying  the  expansion  factor  that  relates  the  sampling  of  the  market 
segment  to  the  total  population,  a  forecast  of  the  aggregate  responses 
to  shopping  behavior  categories  (or  functional  shopping  destinations) 
is  made.  For  a  given  traffic  zone  or  subzone,  one  can  use  census  or 
other  secondary  data  to  identify  the  number  of  households  belonging  to 
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a  particular  travel  market  segment.  The  multinomial  response  rela- 
tion model  predicts  the  proportion  of  the  market  segment  that  chooses 
a  given  functional  shopping  destination  category.  An  example  of  such 
a  functional  destination  category  is  the  set  of  shopping  areas  falling 
within  a  specified  retail  floor  area  range  and  a  distance  interval. 
Since  retail  floor  area  and  distance  are  used  in  defining  functional 
shopping  destinations,  one  needs  further  factors  that  condition  the 
choice  of  specific  shopping  areas  that  fall  within  the  functional 
shopping  destination  category  in  question.   Such  factors  may  include 
the  frequency  of  sales  of  certain  items  in  one  locality  or  the  depend- 
ability of  products  of  certain  stores  in  another.  The  measures  of 
factors  conditioning  the  selection  of  specific  shopping  destinations 
for  a  given  commodity  depend  on  the  type  of  market  segment  and  the 
particular  stores  that  fall  within  the  functional  destination  category 
in  question.   The  measures  also  depend  on  the  geographic  location  of 
the  zone  or  subzone  where  the  market  segment  originated  from  in  "home 
to  shop"  travel.   It  is  clear  that  an  attempt  to  develop  a  uniform 
value  system  for  defining  factors  affecting  the  choice  of  specific 
shopping  areas  introduces  insurmountable  complications.  A  finer 
classification  of  functional  shopping  destinations  provides  a  large 
number  of  alternative  categories.   In  such  an  event,  the  number  of 
specific  destinations  is  likely  to  be  small  enough  for  the  assignment 
of  specific  destinations  to  be  based  on  experience  with  the  patronage 
of  particular  shopping  areas.  At  this  level,  a  simulation  model  can 
be  used  to  assign  specific  destinations  whereby  the  probabilities  of 
specific  destination  choice  are  dependent  on  prevailing  local 
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conditions.  A  simple  license  plate  survey  at  selected  shopping  areas 
can  provide  information  on  the  origin  of  customers  found  at  different 
shopping  areas.  This  information  gives  an  idea  of  the  relative  patron- 
age of  certain  types  of  shopping  areas. 

The  current  belief  that  disaggregate  models  developed  with  tradi- 
tional random  sampling  require  smaller  samples  than  aggregate  models 
is  not  supported  by  the  theoretical  work  in  this  research.  Separate 
sampling  in  the  context  of  stratified  sampling  at  activity  center- 
market  segment  interfaces  is  recommended.  It  is  also  suggested  sample 
sizes  should  be  large  enough  in  order  to  validate  statistical  pro- 
ceedures  associated  with  the  model  development  that  assume  the  avail- 
ability of  large  samples.  Random  sampling  leaves  to  chance  the 
frequency  of  association  observed  for  each  category  of  functional 
shopping  destination.  In  order  to  cover  the  infrequently  chosen 
categories,  a  very  large  sample  may  be  needed.  Separate  sampling 
considers  each  functional  shopping  destination  category  as  a  stratum 
and  samples  each  stratum  as  in  stratified  sampling.  With  separate 
sampling  there  is  no  danger  of  over-representing  certain  categories  of 
functional  shopping  destinations  at  the  expense  of  others. 

The  multinomial  response  relation  approach,  with  a  standardized 
functional  choice  set, can  be  viewed  as  a  discrimination  procedure  if 
further  variables  are  available  for  such  discrimination  above  the 
variables  used  for  defining  functional  destinations.  Travel  decisions 
like  mode  and  time  of  day  choices  have  standardized  alternatives. 
This  enables  a  ready  application  of  a  recently  developed  logistic 
discrimination  approach  with  separate  sampling.   The  logistic 
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discrimination  procedure  uses  the  logistic  distribution  as  against 
classical  techniques  that  use  the  multivariate  normal  distribution. 

Major  Conclusions  and  Recommendations 
Choice  Set  Definition 

Current  disaggregate  modelling  of  destination  choices  rejects  the 
assumption  that  all  individuals  have  the  same  set  of  physical  destina- 
tions and  goes  to  an  extreme  in  specifying  physical  destination  choice 
sets  of  varying  size  for  different  individuals.  The  result  is  that  the 
methodology  cannot  be  adopted  on  a  large  scale.  The  predictive  accur- 
acy of  logit  models  using  arbitrarily  selected  destination  choice  sets 
is  not  satisfactory.   It  is  suggested  that  functional  destinations 
defined  with  the  attraction  attributes  of  shopping  areas  and  individ- 
uals' relative  location  to  shopping  destinations  be  used  in  order  to 
define  a  common  shopping  destination  behavioral  choice  set.   The 
multinomial  response  relation  methodology  postulates  a  linear  model 
that  predicts  the  probability  of  association  with  each  functional 
shopping  destination  for  each  travel  market  segment. 

Aggregation 

Current  generalized  probabilistic  choice  modelling  of  travel 
behavior  does  not  have  any  provision  for  making  aggregate  forecasts 
tied  to  its  theoretical  formulation  or  data  collection  scheme. 
The  multinomial  response  relation  approach  has  a  direct  provision 
for  aggregation.   Only  appropriate  expansion  factors,  derived  from 
the  sampling  framework  used,  need  be  applied  to  predicted  frequency 
responses  to  functional  destination  categories  to  obtain  aggregate 
forecasts. 
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Attitudinal  Orientation 

Few  attitudinal  factors  need  be  considered  in  a  single  model  for 
shopping  travel  prediction.  A  suitable  market  segmentation  by  attitud- 
inal orientation  can  use  only  two  attitudinal  factors  -  one  relating 
to  measures  of  the  wide  range  of  goods  and  services  at  a  shopping  area, 
the  other  to  the  ease  of  getting  to  the  shopping  area  in  terms  of 
distance  or  travel  time.  Even  with  a  two-way  classification,  good 
predictability  of  attitudinal  orientation  patterns  requires  the  consid- 
eration of  certain  interaction  effects. 

Market  segmentation  by  single  attitudinal  factors  is  statistically 
shown  to  be  valid.  A  direct  behavioral  interpretation  of  the  patterns 
of  functional  destination  choice  behavior  for  different  market  segments 
is  possible  in  ill  cases. 

Predictive  Performance  of  the  Conditional  Logit  Model  and  the  Niultino- 
mial  Relation  Response  Relation  Model  for  Actual  and  Functional 
Shopping  Destination  Choices,  Respectively 

All  the  models  developed  using  the  conditional  logit  model  for 
predicting  specific  destination  choices  achieved  less  than  50  percent 
predictive  accuracy,  notwithstanding  other  objections  to  the  use  of 
the  methodology  with  regard  to  choice  set  definition  and  aggregation. 
An  alternative  modelling  framework  in  terms  of  multinomial  response 
relations  is  proposed,  using  functional  shopping  destinations.  Except 
for  a  few  categories  of  low  frequency,  the  empirical  investigation  of 
the  methodology  gives  very  good  forecasts  of  functional  shopping 
destination  choices. 
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Reduction  of  Data  Requirements  and  the  Need  for  Statistically  Sound 
Modelling 

The  need  to  simplify  data  collection  needs  in  model  development . 
often' conflicts  with  the  requirements  for  good  statistical  modelling. 
The  finding  that  few  attitudinal  factors  need  be  considered  in  one 
model  is  in  a  good  direction  as  far  as  the  simplification  of  data 
collection  is  concerned.  To  obtain  more  reliable  and  sounder  models 
one  has  to  take  advantage  of  the  asymptotic  properties  of  the  joint 
distribution  of  model  parameter  estimators,  a  realization  of  which 
can  only  be  in  terms  of  large,  but  not  excessive,  sample  sizes.  It  is 
concluded  that,  in  order  to  achieve  a  suitable  compromise  and  yet 
cover  as  many  travel  market  segments  as  possible,  it  is  necessary  to 
use  separate  sampling  which  is  an  efficient  way  of  data  collection 
that  avoids  redundant  data  collection.  The  extension  of  the  multi- 
nomial response  relation  methodology  into  a  separate  sampling  context 
is  possible, but  the  discussion  of  the  latter  in  terms  of  logistic 
discrimination  is  for  the  reason  that,  with  the  standardization  of 
functional  destination  choice  sets,  the  multinomial  response  relation 
procedure  can  be  regarded  as  that  of  discrimination  with  a  common  func- 
tional choice  set. 

Recommendations  for  Further  Research 

The  following  are  some  areas  in  which  further  research  is 
recommended. 

1.   The  extension  of  the  multinomial  response  relation  approach 
to  cover  trip  generation  and  time  of  day  choices  and  the 
prediction  of  aggregate  demand. 
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Given  that  functional  travel  alternatives  relative  to  spatial 
choices  are  standardized,  research  is  needed  to  investigate 
the  extent  to  which  the  multinomial  response  relation  model 
can  be  extended  as  a  separate  sampling  logistic  discrimina- 
tion problem. 

A  large  scale  investigation  is  recommended  to  illustrate  the 
incorporation  of  the  multinomial  response  relation  model 
into  a  simulation  model  that  brings  the  analysis  to  specific 
shopping  destination  and  route  choices . 

Work  is  needed  in  defining  appropriate  shopping  area  attrac- 
tion factors  or  variables  that  are  related  to  destination 
choice  attributes  that  are  predominant  in  attitudinal 
surveys . 
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APPENDIX  A 

SAMPLE  CALCULATIONS  IN  THE  ESTIMATION  OF 

THE  PARAMETERS  OF  A  MULTINOMIAL  RESPONSE 

RELATION  MODEL 


This  Appendix  discusses  the  estimation  procedure  in  the  use  of 
the  multinomial  response  relation  model  for  analyzing  a  contingency 
table  that  defines  functional  zonal  shopping  destinations  for  people 
travelling  from  zones  with  an  average  zonal  income  of  $7001  -  $8000. 
A  discussion  of  functional  destination  choice  modelling  using  zonal 
shopping  destinations  is  given  in  Appendix  B.  The  functional  desti- 
nation contingency  table  for  three  categories  each  of  Zonal  Retail 
Floor  Area  and  Interzonal  Airline  Distance  is  shown  in  Figure  A.l. 

The  main  objective  is  to  predict  the  probability,  p.,  ,  with  which 
a  travel  market  segment  j  (j  =  l,...,n)  will  select  cell  h  (h  =  l,...,m) 
as  illustrated  in  Figure  A.l,  for  which  n  =  1  and  m  =  9.  The  proba- 
bility or  likelihood  associated  with  a  response  pattern  (ie  the  cell 
frequency  being  r.,  in  cell  h)  for  a  market  segment  j  is  assumed  to 
follow  from  the  multinomial  response  law  as  given  in  equation  5.1  of 
Chapter  5  by 

N-.  I  r..  r.-     r 

:1  „  j2      jm 


L.  =  7^ ; r   P-1   P-T  •••  P-  A'l 

j   r..!  r.„!...r.  !   ^il  m2     ^ira 


r^ i f-   p.,   p.i   ...  p. 

r.-!  r..!...r.  !   ^jl  ^j2     ^j 
Jl   ]2     jm 

The  total  likelihood  of  a  set  of  n  market  segments  is  given  by  ^     L^ 

The  maximum  likelihood  parameter  estimation  procedure  estimates 

parameters  that  can  be  used  to  determine  the  probabilities  p.^^  by 


n 


j  =  l  ' 
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maximizing  the  total  likelihood  function. 

The  probability  p.,  is  assumed  to  be  given  by  the  multiple  logit 
model  probability  law  as 

A. 2 


jh    Z.,    Z.-     Z. 
e    +  e   . . . +e 


where  Z.,  is  a  multivariate  logit  defined  as 
jh 

Zjh  =  ?j  •  §  •  ?h  A-2 

Ixq  qxt  txl 

where  x.  is  the  jth  row  of  a  design  matrix  X    associated  with  the 
-J  ~nxq 

market  segmentation  process  and  a  is  the  hth  column  of  the  design 
matrix  A    corresponding  to  the  classification  of  the  response  cate- 
gories.  The  X  and  A  matrices  have  been  discussed  in  detail  in  Chapter  5, 
An  element  of  the  row  vector  x.  is  0  or  1  according  as  the  effect 
corresponding  to  its  column  position  is  absent  or  present  when  consider- 
ing the  probability  associated  with  response  category  h  of  market 
segment  j .   Similarly  the  lth_  row  element  of  the  column  vector  a,  is  0 
or  1  according  as  the  effect  associated  with  the  Ith^  row  is  absent  or 
present . 

Each  element  &,  ,  of  the  matrix  g  is  the  parameter  corresponding  to 
the  kth  column  of  x  and  the  lth_  row  of  A. 

The  applications  described  in  this  research  develop  models  for  a 
single  market  segment  so  we  subsequently  will  discuss  the  sample  calcu- 
lations for  n  =  1  (ie  the  x  matrix  is  [1]).   We  therefore  replace  p., 

with  p,  and  Z.,  with  Z,    and  have 
^h      jh       h 

^h 

\  =  -z--^^ z-  A-^ 

12m 
e   +  e  +  .  .  .  e 
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where 

^h=  §   ?h 
Ixt  txl 

where  equation  A. 4  gives  the  expression  for  the  probability  associated 
with  response  category  h  determined  from  a  linear  effect  experimental 
model  expression  for  the  multivariate  logit  Z,  as  indicated  above. 
The  1th  element  of  a,  is  0  or  1  according  as  the  1th  effect  6.  is  absent 
or  present  in  the  definition  of  response  category  h  in  the  response 
matrix  A. 

An  extensive  discussion  has  been  given  in  Chapter  5  why  a  repara- 
meterized  form  of  the  model  in  equation  A. 5  has  to  be  used  in  the  esti- 
mation of  the  multivariate  logits  Z.  (h  =  l,...,m).  For  the  application 
under  discussion  for  which  n  =  1  we  have  the  following  linear  form  for 
the  reparameterized  model 

Z.   =  r,    T  A. 5 

-Ixm   -Ixs  ~  sxm 

*'^®''®  ?  Ixm^^  ^  vector  of  multivariate  logits  r    .^  ^  ,^g^^^^  ^^ 
parameters  in  the  reparameterized  model  discussed  further  in  a  later 

section.   T  is  a  row  basis  of  the  matrix  A  discussed  in  Chapter  5. 
sxm 

S  is  the  row  rank  of  the  matrix  A. 

For  the  contingency  table  given  in  Figure  A.l  the  effects  of  the 
three  Zonal  Retail  Floor  Area  are  denoted  by  F  ,  F^,  and  F^  and  those 
of  the  Interzonal  Airline  Distance  categories  by  D^,  D^  and  D^.  The 
cells  of  the  classification  in  Figure  A.l  are  numbered  as  follows 


DALC 
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168 

106 

58 

100 

69 

34 

120 

103 

70 

FLAC 
I 
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DALC 
I 
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ZONAL    RETAIL   FLOOR     AREA    4.    150 
150  ^    ZONAL    RETAIL    FLOOR     AREA   ^    350 

350  Z    ZONAL    RETAIL    FLOOR     AREA 
EACH    ZONAL  RETAIL    FLOOR  AREA    UNIT  =1000  sq.ft. 

INTERZONAL    AIRLINE    DISTANCE    ^    1.5 
1.5^    INTERZONAL    AIRLINE     DISTANCE    4.    3.5 

3.5^    INTERZONAL     AIRLINE     DISTANCE 
EACH    AIRLINE    DISTANCE    UNIT  =0.833  miles 


FIGURE  A.I    FUNCTIONAL    DESTINATION    CLASSIFICATION 

DEFINED    BY    ZONAL     RETAIL    FLOOR   AREA     AND 

INTERZONAL    AIRLINE    DISTANCE    FOR    AVERAGE 
ORIGIN     ZONE    INCOME    OF     $7001-^8000 
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•'l  ^2   ^3 


h 

1 

2 

3 

'2 

4 

5 

6 

h 

7 

8 

9 

m  =  9 


Denoting  the  general  mean  effect  by  y  the  A_  _  matrix  which  considers 
the  general  mean  and  the  main  effects  F^ ,  F  ,  F_,  D  ,  D-  and  D_  is 
given  by  (see  Chapter  5) : 


cell  no.- 


-7x9 


12  3  4  5  6  7  8 
11111111 
1110  0  0  0  0 
00011100 
0     0     0     0     0     0     1 


10  0  10  0 
0  10  0  10 
0     0     10     0     1 


9 
1 
0 
0 
1  1 
1  0  0 
0  1  0 
0     0     1 


Effect 
U 
F, 


The  1x7  parameter  vector  g  is  given  by 

^1x7  =  t^  h     ^2     h     ^1  °2  °3^ 

Applying  the  model  given  in  equation  A. 5  ,  the  multivariate  logit  Z 
corresponding  to  the  sixth  response  category  is  the  product  of  6  and 
the  sixth  column  of  A    ie 


Z,  = 


[yF^  ^2  ^3  ^1 


^2  ^3^ 


1 
0 
1 
0 
0 
0 

1 


U  -H  F2  +  D3 
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Z-  as  determined  above  indicates,  as  expected  ,  that  cell  6  is  affected 
by  the  general  mean  and  the  ¥     and  D  main  effects.   Given  that  the 
total  sample  size  N  is  constant,  if  two  marginal  frequencies  are  known 
for  each  way  of  classification  in  Figure  A.l,  the  third  marginal 
frequency  can  be  determined  by  subtraction.   For  this  reason,  each  way 
of  classification,  in  a  main  effects  model,  contributes  2  to  the  rank  of 
the  matrix  A.  The  rank  of  A  is  therefore  4.  A  more  detailed  discussion 
of  the  rank  of  A  is  given  in  Chapter  5.  The  implication  of  the  rank  of" 
A  being  4  is  that  only  4  parameters  can  be  estimated  in  the  reparameter- 
ized  main  effects  model  as  given  in  equation  A. 5. 

The  relationship  between  the  elements  of  the  1x4  parameter  vector 
r  and  the  1x7  parameter  vector  6  depends  on  the  form  of  the  row  basis  T 
of  A  that  is  selected  for  the  estimation  of  r.  The  construction  of  the 
basis  T  from  contrasts  of  the  effects  of  each  way  of  classification  in 
a  contingency  table  relative  to  one  category  effect  was  discussed  in 
Chapter  5.  The  model  results  are  indifferent  to  the  choice  of  basis,  a 
fact  which  will  be  shown  by  choosing  two  types  of  basis  T  -  one  with 
effects  considered  relative  to  the  first  category  effect  and  the  other 
with  the  effects  considered  relative  to  the  last  category  effect  of  each 
way  of  classification. 

Basis  T  Development  and  Parameter  Estimation 

Following  the  discussion  on  basis  construction  in  Chapter  5,  the 
rows  of  the  basis  T  for  the  contingency  table  in  Figure  A.l  are  formed 
from  the  Kronecker  products  of  the  columns  of  two  matrices  each  of  the 
form    Or  T =  [l^j  C_  „]  where  each  Q  matrix  represents  a  way  of 
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classification  in  Figure  A.l,l_  is  a  3x1  column  vector  each  component 
of  which  is  unity. 

C_  _  is  a  contrast  matrix  that  contrasts  the  main  effect  of  one 
category  with  each  of  the  remaining  classes. 

If  one  wishes  to  estimate  parameters  relative  to  the  first  category 
effect  the  rows  of  the  basis  T  are  formed  from  the  Kronecker  product  of 
the  following  matrices: 


1  = 


§1  = 


0 

1 

2 

J  = 

0 

1 

2 

1 

-1/3 

-1/3" 

"1 

-1/3 

-1/3 

1 

2/3 

-1/3 

%  = 

1 

2/3 

-1/3 

1 

-1/3 

2/3. 

.1 

-1/3 

2/3 

A. 6 


Denoting  the  Kronecker  product  of  column  i  of  Q,  and  column  j  of 
Q^  by  i,j,  the  four  rows  of  the  T  basis  for  the  main  effects  model  for 
the  contingency  table  in  Figure  A.l  are  given  as  follows,  (assuming  Q, 
and  Q-  represent  Retail  Floor  Area  and  Interzonal  Distance  respectively) 


ow 

Kronecker  Product 

Effec 

Indices 

1 

1,0 

^2-^ 

2 

2,0 

^3-^ 

3 

0,1 

D^-D^ 

4 

0,2 

^3-^1 

The  resulting  4x9  basis  T  is  shown  in  Table  A.l.  The  basis  given 
by  the  computer  program  is  multiplied  by  the  factor  that  is  indicated 
below  the  print-out  of  the  iflatrix  T.   In  subsequent  calculations  the 
matrix  T  printed  is  divided  by  the  multiplying  factor.   For  the  problem 
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Table  A.l  Sample  Calculations  for  a  Multinomial  Response  Relation 
Model 


FLAC  X  DALC  for  Avg.  Income  of  $7001  -  $8000 

T-  Matrix 

-3-3    -3   6    6  6 

-3   -3   -3   -3   -3  -3 

-3    6-3-3    6  -3 

-3   -3    6-3-3  6 

The  4  by  9  Matrix  has  a  Factor  of  9 

12    0    0 

0  0    12 


3 

-3 

-3 

6 

6 

6 

3 

6 

-3 

3 

-3 

6 

Kronecker  Product  Indices 


Iteration  No.  1 

Trial  Parameter  Matrix  -  TZ 

0.0000    0.0000 

0.0000 

0,0000 

Probability  Matrix 

Column         1 
5 
9 

2 
6 

3 

7 

4 
8 

Row  1        0.1111 

0,1111 

0.1111 

0,1111 

0.1111 

0,1111 

0,1111 

0,1111 

0.1111 

First  Derivative  of 

Log  Likelihood  -  DTZ 

-73.0000 

17,0000 

2.0000 

-114.0000 

Table  A.l  Cont. 
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Second  Derivative  of  Log  Likelihood  -  DDTZ 

-184.0000      92.0000    0.0000    0.0000 
-184.0000    0.0000    0.0000 
0.0000  -184.0000   92.0000 
0.0000   92.0000  -184.0000 


92.0000 
0.0000 
0.0000 
Correction  To  TZ 

4.6739  E-01 

1.4130  E-01 

3.9855  E-01 

8.1884  E-01 
Estimated  Matrix  -TZ 

-0.4674      -0.1413 
Iteration  No.  2 
Trial  Parameter  Matrix  -  TZ 

-0.4674      -0.1413 
Probability  Matrix 
Column 


-0.3986 


-0.3986 


Row  1 


1 
5 
9 

0.1898 

0.0798 

0.0726 


2 
6 


0.1274 
0.0524 


3 
7 


0.0837 
0.1648 


4 
8 


0.1189 
0.1106 


-0.8188 


-0.8188 
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Table  A.l  Cont. 


72.3744 

0.0000 

0.0000 

-187.8713 

0.0000 

0.0000 

0.0000 

-179.5164 

54.9338 

0.0000 

54.9338 

-136.7667 

First  Derivative  of  Log  Likelihood  -DTZ 

-4.9691 

4.8514 

14,8522 

-10.8503 

Second  Derivative  of  Log  Likelihood  -DDTZ 

-155.7334 

72.3744 

0.0000 

0.0000 

Correction  to  TZ 

2.4248  E-02 

-1.6482  E-02 

-6.6649  E-02 

5.2564  E-02 

Estimated  Matrix  TZ 

-0.4916      -0.1248      -0.3319      -0.8714 

Iteration  No.  3 

-0.4916      -0.1248      -0.3319      -0.8714 

Probability  Matrix 

Column  12  3  4 
5  6  7  8 
9 

Row  1    0.1877      0.1347      0.0785      0.1148 

0.0824      0.0480      0.1657      0.1189 

0.0693 
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Table  A.l  Cont. 


First  Derivative  of  Log  Likelihood 

-0.0344 

-0.0057 

-0.1647 

-0.1806 
Second  Derivative  of  Log  Likelihood  -DDTZ 


-153.2482 

71.8481 

0.0000 

0.0000 

71.8481 

-189,3193 

0.0000 

0.0000 

0.0000 

0.0000 

-184.7159 

54.4842 

0.0000 

0.0000 

54.4842 

-130.4142 

Correction  to  TZ 

2.9000  E-04 

1.4011  E-04 

1.4830  E-03 

2.0041  E-03 

Estimated  Matrix  TZ 

-0.4919 

-0.1250 

-0.3334 

-0.8734 

Covariance  Matrix  for  Parameter  Vector 

7.9377  E-03 

3.0124  E-3 

0.0000 

0.0000 
Predicted  Frequencies 

155.4     111.5     65.0     95.1     68.2     39.8     137.2     98.4     57.4 


3.0124  E-3 

0.0000 

0.0000 

6.4253  E-3 

0.0000 

0.0000 

0.0000 

6.1746  E-3 

2.5796  E-03 

0.0000 

2.5796  E-3 

8.7456  E-3 
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Table  A.l  Cont. 


Observed  Frequencies 

168.0  106.0  58.0  100.0  69.0  34.0  120.0  103.0  70.0 
Goodness  of  Fit  CHISQ  8.24  for  4  D.F. 
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of  Figure  A.l  this  factor  is  9,  The  Kronecker  product  indices  that 
went  into  the  development  of  the  basis  T  are  also  indicated  in  Table 
A.l. 

The  parameter  vector  r,  .   estimated  in  the  reparameterized  main 
effects  model  is  given  by 

^1x4  =  fcv^p.  cvV'  fW'  fv^i^i  ^•^ 

The  maximum  likelihood  estimation  procedure  maximizes  the  log  of 
the  likelihood  function  given  in  equation  A.l.  The  computer  program 
written  for  this  purpose  uses  the  Newton-Raphson  gradient  technique 
as  discussed  between  equations  5.13  and  5.15  of  Chapter  5.  The  esti- 
mation procedure  uses  the  first  derivative,  cp(r)  and  the  second  deriva- 
tive 1f(r)  of  the  log  of  the  likelihood  function  given  in  equation  A.l. 
All  the  derivatives  are  with  respect  to  the  elements  of  r. 

For  the  problem  of  Figure  A.l  under  discussion  we  have  from 
equation  5.9  of  Chapter  5, 

?3j^l(n  =  T   (r  -  NP)  A.8 

sxm   mxl 

where  r  and  P  are^   respectively,  mxl  vectors  of  observed  cell  frequen- 
cies in  the  contingency  table  under  consideration  and  cell  probabilities 
determined  from  estimated  logits  in  equation  A. 4.  N  is  the  total 
number  of  observations  or  individuals.  The  other  terms  in  equation  A.8 
have  been  previously  defined. 

The  matrix  of  second  derivatives  of  the  log  of  the  likelihood 
function  in  equation  A.l  is  given  from  equation  5.11  of  Chapter  5  by 

y  (r)  =  -  N  T   W  T"  A. 9 

sxs        sxm  mxm  mxs 
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where  W  is  the  variance  -  covariance  matrix  of  the  estimators  of 
P,  given  by 


Pl(l-Pp 

-^1^2  ••• 

-P,P 
1  m 

y^ 

P2(l-P2) 

"'2'- 

w    = 

mxm 

-Vi 

-V2      ••• 

Pm(l-V 

A. 10 


For  the  main  effects  model  for  the  problem  of  Figure  A.l 
s=4,m=9. 

The  following  equivalencies  in  the  results  given  in  Table 
A.l  result: 


r,   H  TZ 
-Ixs 

P   ,  =  Probability  Matrix  printed  out  as  a  row  vector  in  the 

computer  output  shown  in  Table  A.l. 

(g(r)  =   DTZ 

sxl 
^(r)  =  DDTZ 
sxs 

The  Newton-Raphson  estimation  procedure  uses  a  trial  parameter 

vector  and  computes  the  first  and  second  derivatives  of  the  log  of  the 

likelihood  function  of  equation  A.l  in  order  to  compute  corrections  to 

be  applied  to  the  assumed  parameter  vector.  The  corrected  parameter 

vector  becomes  the  trial  parameter  vector  for  the  next  iteration  until 

convergence  according  to  some  assigned  tolerance  is  achieved.   In  the 

estimation  program  used,  the  tolerance  is  such  that  the  absolute  value 

of  the  ratio  of  the  correction  to  the  parameter  value  must  be  less  than 

,0.005  for  each  element  of  the  estimated  parameter  vector: 
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Given  that  the  trail  parameter  vector  at  the  ith  iteration  is  r., 
the  trial  logits  are  given  by  equation  A. 5  as: 

z.   =  r.  T. 

hxm  -"■  ~ 
The  trial  probabilities  P.  (h  =  l,...,m)  are  computed  by  substituting 
the  elements  Z,  (h  =  l,...,m)  in  equation  A. 4.  The  trial  probabilities 
then  form  the  trial  probability  vector  P.  used  to  compute  the  log 
likelihood  first  and  second  derivatives  cp(rO  and  4'(r.)  respectively 
in  equations  A. 8  and  A. 9. 

The  vector  of  corrections  to  the  ith  trial  parameter  vector  is 
given  by  j 

6.  =  -'{'"■^(r.)  cp(r.)  A. 11 

The  (i+l)th  trial  parameter  vector  becomes 

f.  ,  =  f.  +  6. 
-1+1    -1   -1 

In  all  the  model  parameter  estimation  problems  considered  the 
estimation  program  starts  with  a  trial  parameter  vector  with  zero 
elements  for  iteration  1  as  shown  in  Table  A.l.   In  the  estimation  of 
the  parameters  for  the  main  effects  model  for  the  problem  of  Figure  A.l, 
convergence  is  achieved  after  3  iterations.  The  final  estimated  para- 
meter vector  and  its  covariance  matrix  are  given  by  the  estimation 
program.  The  covariance  matrix  has  been  shown  in  Chapter  5  to  be  given 
by  the  negative  of  the  inverse  of  the  matrix  of  second  derivatives  of 
the  log  of  the  likelihood  function  given  in  equation  A.l.  The  predicted 
cell  frequencies  are  given  by  the  product  of  the  total  sample  size  and 
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the  predicted  cell  probabilities.  The  parameters  estimated  in  Table 
A.l  are: 


F2  -  Fj  =  -0.4919 
Fj  -  Fj  =  -0.1250 


^2-^1 


-0.3334 


A.i: 


D  -  Dj  =  -0.8734 


The  goodness -of- fit  x  value  (discussed  later)  =  8.24  (4d.f.) 

In  order  to  estimate  a  set  of  parameters  that  are  relative  to  the 

last  category  effect,  a  different  T  basis  must  be  used,  developed  from 

Q-matrices  (see  equation  A, 6)  that  include  contrast  matrices  relative 

to  the  last  category  effect. 

The  appropriate  matrices  that  must  replace  Q^  and  Q-  in  equation 


J  = 


A. 14 


A, 6  are: 

i  =  0 

1 

2 

1 

2/3 

-1/3 

?1  = 

1 

-1/3 

2/3 

1 

-1/3 

-1/3 

0 

1 

2 

1 

2/3 

-1/3 

1 

-1/3 

2/3 

1 

-1/3 

-1/3 

Using  the  procedure  outlined  earlier  for  developing  the  basis  matrix 
T,  the  basis  corresponding  to  the  main  effects  model,  where 
effects  are  considered  relative  to  the  last  category  is: 


T 


=  1/9 


6   6   6-3-3-3   -3   -3  -3 

3-3-3   6   6   6-3-3  -3 

6-3-3   6  -3  -3   6   -3  -3 

3   6-3-3   6-3-3   6  -3 


A. 15 
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The  parameters  estimated  using  this  basis  are: 


F  -  F  =  0.1250 

F  -  F  =  -0.3670 

^  ^  "  A. 16 

D^  -  Dj  =  0.8734 

D^  -  Dj  =  0.5400 

2 
Goodness- of  fit  x   (discussed  later)  =  8.24, 


We 


estimate  F_  -  F^  and  D_  -  D^  from  (4  d.f.)  equation  A. 16  as: 


F2  -  F^  =  -0.4920 

D2  -  Dj  =  -0.3334 
The  predicted  cell  frequencies  given  by  the  parameter  vectors  whose 
elements  are  given  in  equations  A. 12  and  A. 16  are  also  exactly  the 
same.   It  is  now  clear  that  the  modelling  results  with  respect  to 
multivariate  logit  (and  hence  cell  probability)  estimation  are  invariant 
with  respect  to  the  choice  of  basis  T. 

2 
Likelihood  Ratio  Goodness  of  fit  x  • 

It  was  shown  in  Chapter  5  (see  equation  5.46)  that  with  large 

sample  maximum  likelihood  parameter  estimation  the  joint  distribution 

of  the  parameters  tends  to  a  multivariate  normal  distribution  with 

mean  vector  r  and  covariance  matrix  -4*   [r]  .  Under  this  assumption 

-2  X  the  log  of  the  likelihood  function  in  equation  A.l,  the  likelihood 

2 
ratio  statistic,  tends  to  a  x  variable  with  m-l-s  degrees  of  freedom. 

2 
This  goodness  of  fit  x  .  given  by 

-      m 
X  =-2  I  r^log^N  \lr^  A.  17. 

n=l 
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provides  a  test  for  how  well  an  estimated  parameter  vector  provides 
cell  frequencies  that  come  close  to  the  observed  frequencies. 


Estimation  of  a  Model  that  includes  an  Interaction  Effect. 

The  inclusion  of  an  interaction  effect  in  the  construction  of  the 
A  matrix  and  the  basis  T  [see  earlier  sections  of  this  Appendix]  were 
discussed  in  Chapter  5.  Referring  to  equations  A. 6  and  A. 16  the  esti- 
mation of  an  interaction  i,  j  in  addition  to  the  main  effects  estimated 
is  accomplished  by  increasing  the  row  rank  of  the  basis  T  by  adding  a 
row  to  the  main  effects  basis  formed  by  the  Kronecker  product  of 
columns  i  and  j  of  matrices  Q^  and  Q_.  An  interaction  i,  j  in  a  2-way 
contingency  table  means  the  interaction  between  the  itli  row  parameter 
and  the  jth  column  parameter  in  the  reparameterized  model.  Hence  a 
2,2  interaction  in  the  model  developed  with  equation  A. 6  is  the  interact- 
ion between  (F.-F^)  and  (D_-D-).   In  the  model  associated  with  equation 
A.  16  a  2,  2  interaction  is  the  interaction  between  (F_-F  ) . 

For  the  model  developed  from  equation  A. 6  the  5x9  T  basis  that 

includes  a  2,2  interaction  is  given  by: 

-3-3-3   6   6   6   -3   -3  -3 

-3   -3  -3   -3-3-3   6   6  6 

-3   6  -3   -3  -6  -3   -3   6  -3 

-3  -3   6-3-3   6  -3   -3  6 

1   1-2   1   1-2-2-2  4 

As  noted  above,  this  2.2  interaction  is  a  (F  -F^)  by  (D_-D,) 

interaction.   The  parameters  estimated  by  tlie  resulting  model  are: 


I5x9  =  1/9 
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D2  -  D 

D3-  D 

CF3-F 


=  -0.4919 
=  -0.0716 
=  "0.3334 
=  -0.8938 


)  by  (Dj  -  Dj)  =  0.4131 

Interaction 

2 
Goodness  of  fit  x  value  =  2.96  (3  d.f.) 

A  comparison  of  the  predicted  and  observed  response  patterns  is 
as  follows: 

Cell  No.  123456789 

Predicted  Frequency  160  115   57   98   70   35  130   93   70 
Observed  Frequency  168  106   58  100   69   34  120  103   70 

2 
The  reduction  in  x  value  due  to  the  2,2  interaction  compared  to 

the  main  effects  model  (see  Table  A.l)  =  8.24  -  2.96  =  5.28  which  is 

significant  at  the  5  percent  level.  This  significant  reduction  in 

2 
goodness  of  fit  x  value  is  accompanied  by  an  improved  set  of  predicted 

frequencies  compared  to  those  in  Table  A.l. 

Model  Parameter  Estimation  Cost 

Though  the  explanation  of  the  estimation  of  parameters  for  a 
multinomial  response  relation  model  may  be  lengthy,  the  estimation  pro- 
cedure is  very  simple  and  cheap.   The  models  developed  in  this  research 
rarely  cost  over  15  cents  per  run  once  a  compiled  version  of  the 
estimation  computer  program  is  available.  The  cost  of  compiling  the 
estimation  computer  program  is  less  than  40  cents. 
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APPENDIX  B 

FUNCTIONAL  ZONAL  SHOPPING  DESTINATION 
CHOICE  PREDICTION* 


It  has  been  discussed  in  Chapter  2  why  zonal  retail  floor  area 
or  retail  employment  are  inadequate  measures  of  shopping  attraction 
to  individuals.  The  need  to  agglomerate  physical  shopping  areas,  not 
necessarily  along  traffic  zonal  boundaries,  and  also  consider  more 
direct  characteristics  of  shopping  areas  has  been  established  in 
Chapters  2  and  6.  There  usually  is  no  information  on  what  activities 
were  classified  as  retail  in  which  case  the  zonal  measures  of  retail 
attraction  may  not  give  the  true  picture. 

The  objections  to  the  use  of  zonal  retail  floor  area  or  retail 
employment  are  much  more  serious  with  respect  to  the  modelling  of  indi- 
vidual shopping  travel  behavior  than  to  the  modelling  of  the  shopping 
behavior  of  groups  of  people.   It  should  therefore  be  possible  to  test 
a  few  essential  assumptions  hitherto  used  in  shopping  destination 
choice  modelling  with  data  using  zonal  measures  of  shopping  travel 
attraction  for  broad  market  segments. 


*A11  multinomial  response  models  discussed  in  this  Appendix  estimate 
parameters  relative  to  the  last  category. 
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The  data  to  be  used  is  on  home-based  one-way  shopping  automobile 
trip  data  assembled  from  a  1964  Indianapolis  Transportation  Study  data. 
Owing  to  the  large  number  of  individuals  involved  (2772)  and  the  expec- 
tation o£  a  random  distribution  of  households  in  a  given  zone,  the 
relative  geographical  location  of  households  and  zonal  shopping  desti- 
nations were  determined  from  Zonal  Centroid  Coordinates.  The  Zonal 
Centroids  were  defined  to  be  approximate  centers  of  the  zonal  land-use 
in  question.   The  functional  shopping  destinations  were  defined  in 
terms  of  a  cross-classification  of  Zonal  Retail  Floor  area  or  employ- 
ment and  interzonal  airline  distances.  Variables  available  for  market 
segmentation  were  Average  Zonal  Income,  Accessibility  to  Zonal  Retail 
Employment,  and  Percentage  single  family  Dwelling  units.  The  retail 
relative  accessibility  measures  were  determined  by  Nakkash  [  18  ]  for 
work  on  zonal  trip  generation  analysis,  using  friction  factors  of 
calibrated  gravity  models  for  trip  distribution.   In  spite  of  the  now 
celebrated  objections  to  the  use  of  the  gravity  model  for  such  a 
definition  of  accessibility,  one  should  get  an  idea  of  the  effect  of 
relative  accessibility  by  using  only  broad  ranges  of  Nakkash' s  measures. 
The  higher  the  relative  retail  accessibility  measure  for  a  given  zone 
the  easier  it  is  for  households  in  that  zone  to  reach  retail  land-use. 

The  Monotonicity  of  the  Effects  of  Shopping  Attraction  Measures  and 
Distance. 

Tables  B.l  and  B.2  display  responses  of  the  total  sample  to  func- 
tional zonal  shopping  destinations  defined  for  Airline  Distance  with 
Zonal  Retail  Floor  Area  and  Zonal  Retail  Employment  respectively.   The 
Airline  Distance  and  Shopping  attraction  measures  are  divided  into  six 
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Table  B.l.  Response  Prediction  for  Functional  Destinations  Defined  by 
the  Crossing  of  Zonal  Retail  Floor  Area  (FLAG)  and  Airline 
Distance  CDALC)a.b 


DALC 

1 

2 

3 

4 

5 

140 

216 

149 

84 

52 

I-I 

C159) 

(242) 

(146) 

(64) 

(30) 

138 

329 

148 

41 

52 

M 

(158) 

(329) 

(142) 

(41) 

(38) 

CJ 

34 

52 

36 

42 

13 

3 

to 

u. 

C28) 

(58) 

(40) 

(42) 

(8) 

71 

110 

76 

43 

27 

^ 

C48) 

(90) 

(82) 

(63) 

(45) 

44 

68 

56 

61 

17 

lO 

C34) 

(56) 

(56) 

(61) 

(39) 

Effects  Fitted:  Main  Effects  +  (1,1,  (1,3),  (2,3),  (4,2) 
and  (4,3)  Interactions 

Likelihood  Ratio  (L.R.)  Goodness-of-fit  X^:  91.61  (11  d.f.) 
^The  Classification  Categories  are  Defined  in  Figure  B.l 
^Cell  Entries  are  Predicted  Frequencies  With  the  Observed 
Values  in  Parenthesis. 
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Table  B.2.  Response  Prediction  for  Functional  Destinations  Defined  by 
the  Crossing  of  Zonal  Retail  Employment  and  Airline 
Distance^ »^ 


DALC 

1 

2 

3 

4 

5 

147 

240 

151 

61 

42 

iH 

(168) 

(257) 

(137) 

(54) 

(25) 

144 

276 

149 

64 

41 

C4 

(157) 

(276) 

(141) 

(64) 

(36) 

CJ 

63 

102 

65 

35 

18 

oj   fn 

■ 

(40) 

(117) 

(73) 

(35) 

(18) 

35 

57 

36 

15 

10 

«* 

(20) 

(36) 

(50) 

(21) 

(26) 

48 

78 

61 

41 

14 

lO 

(52) 

(68) 

(61) 

(41) 

(20) 

Effects  Fitted:  Main  Effect  +  (1,1).  (1,3),  (2,3),  (4,2) 
and  (4,3)  Interactions 

L.R.  Goodness-of-flt  x^'      74.97  (11  d.f.) 
^The  Classification  Categories  are  Defined  in  Figure  B.2 
^Cell  Entries  are  Predicted  Frequencies  with  Observed 
Values  in  Parenthesis. 
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categories  each  at  equal  successive  intervals  beyond  the  first  class  in 

2 
each  case.  The  goodness-of-fit  x  values  give  an  indication  of  the 

independence  of  the  factors  of  classification  and  the  monotonicity  in 

the  marginal  effects  of  the  factors  in  the  response  of  the  sample  to 

functional  zonal  shopping  destinations.   The  acute  lack  of  fit  implied 

2 
by  the  high  x  values  in  Tables  B.l  and  B.2  is  illustrated  in  the 

erratic  pattern  in  the  graphical  representation  of  Tables  B.l  and  B.2 

given  in  Figures  B.l  and  B.2.  Tables  B.l  and  B.2  show  the  first 

5  categories  of  each  classification. 


Table  B.3  shows  attempts  to  fit  up  to  5  interactions  in  addition 
to  main  effects  in  the  use  of  the  multinomial  response  relation  approach 
to  reproduce  the  response  in  the  5x5  contingency  tables  in  Tables  B.l 
and  B.2.   (the  multinomial  relation  approach  is  discussed  in  Chapter  5 
with  a  typical  example  of  its  estimation  procedure  given  in  Appendix  A) . 
All  the  models  in  Table  B.3  were  unsuccessful.  The  reason  for  the 
poor  fit  of  the  models  can  be  any  one  or  combination  of  the  following: 

1.  The  inadequacy  of  the  shopping  opportunity  measures  used. 

2.  Regularity  or  monotonicity  of  marginal  effects  not  applying 
over  a  large  number  of  intervals  for  shopping  attraction  and 
airline  distance  category  variables, 

3.  The  mixing  of  different  market  segments  that  may  respond  to 
functional  destinations  differently. 

4.  The  mixing  of  local  travel  with  primary  urban  travel  that  is  of 
more  importance  to  the  transportation  engineer  ,  as  discussed 
in  Chapter  2. 
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ZONAL  RETAIL  FLOOR    AREA  (FLA)  CLASSES 

FLAG 

I      


50  L 

-  150  L 

250  L 

350  L 

450  L 

(AREA  UNITS  IN  THOUSANDS  OF  SQ.FT.) 


AIRLINE 
DAUC 
I.      


DISTANCE  (DAL)    CATEGORIES 


CO 
UJ 

o 

UJ 

s 

q: 
u. 


bi     160 

Z 

o 

Q. 
(0 

UJ 


FLA  L  50 
FLA  L  150 
FLA  4  250 
FLA  L  350 
FLA  L  450 
FLA   ~ 


DAL  4  0.5 

0.5^    DAL  4   1.5 

I  5  ^  DAL  4.  2.5 

2.5 -<i  DAL  4r  3.5 

3.5^   DAL  ^4.5 

- -  4.5  z.   DAL 

(ONE  AIRLINE  DISTANCE    UNIT  =  0.833MILES) 


2  3  4  5 

AIRLINE  DISTANCE  CLASSES 


FIGURE  B.I  RESPONSE  FREQUENCY  DISTRIBUTIONS  FOR 
ZONAL  FUNCTIONAL  SHOPPING  DESTINATION 
CHOICE  -    FLAG  X  DALC 
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3601- 


ZONAL  RETAIL  EMPLOYMENT  fEMP)  CLASSES 
EMPC 


EMP  ^  50 

50    ^   EMP  ^150 

150  ^   EMP  ^250 

250  c    EMP  4350 

350   ^    EMP  4 450 

- -        450  ^    EMP 

(AREA  UNITS  IN  THOUSANDS  OF  SO  FT) 


AIRLINE    DISTANCE  (DAL)    CATEGORIES 
DALC 


(ONE  AIRLINE  DISTANCE    UNIT    =0.833  MILES) 


DAL  i  0.5 
0.5^  DAL  6  1.5 
I  .5  ^  DAL  ^  2.5 
2.5-^  DAL  1^  3.5 
3.5^  DAL^  4.5 
4.5^    DAL 


2  3  4  5 

AIRLINE  DISTANCE  CLASSES 


FIGURE  B.2  RESPONSE  FREQUENCY  DISTRIBUTIONS  FOR 
ZONAL  FUNCTIONAL  SHOPPING  DESTINATION 
CHOICE  -  EMPC  X  DALC 


236 


Table  B.3.  Likelihood  Ratio  Goodness-of-fit  -^   Values  for  Predicting 
Response  of  the  Total  Sample  to  Functional  Destinations 
Defined  in  5x5  Contingency  Tables 

Interactions  Fitted  in    Type  of  Contingency  Table    d.f .  for 
Addition  to  Main  Effect   j^^  x  DALC^  EMPC  x  DALC^   ^^  Value 


Main  Effect 

193.16 

91.34 

16 

1,3 

138.96 

89.54 

15 

1,3  2,3 

131.77 

89.25 

14 

1,3  2,3  4,2 

131.52 

88.55 

13 

1,3  2,3  4,2  4,3 

110.74 

77.42 

12 

1,3  2,3  4,2  4,3  1,1 

91.61 

74.97 

11 

^FLAC  and  DALC  are  Zonal  Retail  Floor  Area  and  Airline 
Distance  Categories  Defined  in  Figure  B.l. 
TMPC  and  DALC  are  Zonal  Retail  Employment  and  Airline 
Distance  Categories  Defined  in  Figure  B.2. 
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That  the  first  reason  should  not  seriously  affect  the  current 
investigation  has  been  previously  discussed.  The  second  and  third 
possible  reasons  will  be  investigated  by  reducing  the  number  of  cate- 
gories in  the  factors  for  defining  the  functional  destinations  and 
also  by  examining  response  patterns  for  different  market  segments 
defined  with  respect  to  Average  Origin  Zone  Income,  Relative  Accessi- 
bility to  Zonal  Retail  Floor  Area,  and  Percentage  Single  Family 
Dwelling  Units.  The  functional  destination  response  patterns  for  the 
market  segments  did  not  display  any  consistent  patterns  even  when  4x4 
contingency  tables  (the  class  values  increased  by  successively  equal 
intervals  starting  from  the  first  category)  were  used.   It  is  clear  that 
the  failure  of  the  definition  of  factor  categories  by  successive 
equal  intervals  is  an  indication  that  spatial  separation  and  shopping 
destination  attraction  factors  do  not  have  monotonic  or  regular  marginal 
effects  over  their  domains  of  occurrence.   This  brings  into  serious 
question  the  use  of  distance  and  attraction  variables  like  retail  floor 
area  or  retail  employment  as  if  such  monotonicity  prevailed  [See 
Chapter  4] . 

Functional  Destination  Prediction 

Further  results  to  be  discussed  involve  3x3  functional  destinations 
defined  with  either  Zonal  Retail  Floor  Area  or  Retail  Employment  and 
Airline  Distance  for  which  the  categories  are  defined  over  intervals 
that  are  not  necessarily  equal.  The  considerably  reduced  number  of 
categories  of  shopping  attraction  and  distance  measures  conforms  better 
to  the  parsimonious  expectation  that  people  do  not  usually  consider 
large  partitions  of  factors  in  their  response  to  the  urban  environment. 
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It  even  leaves  room  for  the  use  of  categories  like  "regional", 
"community",  and  "neighborhood"  shopping  areas  with  distance  or  travel 
time  measures  in  future  functional  destination  definition.  With  the 
reduced  numbers  of  functional  destination  categories  one  can  use  other 
attributes  of  shopping  areas  without  making  the  total  number  of  alter- 
natives in  the  classification  prohibitive. 

None  of  the  Relative  Accessibility  to  Retail  Floor  Area  and  Per- 
centage Single  Family  Dwelling  Units  market  segments  investigated  gave 
any  response  patterns  that  could  be  reproduced  well  in  all  the  remaining 
analyses  to  be  discussed.  Since  the  adequacy  of  the  definition  of  these 
factors  could  be  questioned  as  well,  their  further  use  in  market 
segmentation  is  not  discussed.   Only  Average  Origin  Zone  Income  (here- 
after referred  to  as  Average  Income)  will  be  considered  in  subsequent 
market  segmentation  discussion.  Four  Average  Income  categories  were 
defined  for  market  segmentation  -  (i)  _<  $6000  (ii)  $6001  -  $7000 
(iii)  $7001  -  $8000  and  (iv)  >  $8000.  The  higher  income  market  segments 

gave  consistently  reasonably  good  results  in  all  cases  considered  in 

2 
terms  of  the  level  of  the  goodness-of-fit  x  value  as  a  test  of  inde- 
pendence and  monotonicity  of  the  factors  for  defining  the  functional 

2 
destination  classifications.   The  Likelihood  Ratio  goodness-of-fit  x 

values  for  various  models  fitted^  using  different  effects,  are  summar- 
ized in  Table  B.4. 

To  attempt  to  exclude  local  travel  that  may  not  really  affect  the 
major  transportation  network,  models  were  developed  where  only  trips 
with  destination  Zone  Retail  Floor  Area  greater  than  50,000  sq.  ft.  or 
with  Interzonal  Airline  Distance  greater  than  1  distance  unit  (0.833 
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DALC 


1 

2 

3 

— 

CM 

ro 

FLAG 
I 
2 
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DALC 
I 

2 
3 


ZONAL    RETAIL    FLOOR     AREA   -e    150 
150  ^    ZONAL    RETAIL    FLOOR     AREA   ^    350 

350  ^    ZONAL    RETAIL    FLOOR      AREA 
EACH    ZONAL  RETAIL    FLOOR  AREA    UNIT  =1000  sq.  ft. 


AIRLINE     DISTANCE   ^    1.5 
1.5  -i    AIRLINE     DISTANCE  ^    3.5 

3.5-^    AIRLINE    DISTANCE 

EACH    AIRUNE   DISTANCE    UNIT  =0.833  miles 


FIGURE  B.3   FUNCTIONAL    ZONAL      SHOPPING     DESTINATION 
CATEGORIES     DEFINED     BY   RETAIL    FLOOR 
AREA    AND     AIRLINE     DISTANCE 
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DALC 
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a. 
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1 

2 

3 
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• 
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fO 

EMPC 

1  ZONAL  RETAIL   EMPLOYMENT    4.     300 

2  300  -^    ZONAL  RETAIL    EMPLOYMENT  ^     700 

3  700  ^   ZONAL  RETAIL    EMPLOYMENT 


DALC 
I 

2 
-     3 


AIRLINE     DISTANCE    ^    1.5 
L5^    AIRLINE     DISTANCE  ^    3.5 
3.5  ^    AIRLINE    DISTANCE 
EACH    AIRLINE    DISTANCE    UNIT  =  0.833  miles 


FIGURE  B.4  FUNCTIONAL    ZONAL      SHOPPING     DESTINATION 
CATEGORIES     DEFINED     BY  RETAIL    EMPLOYMENT 
AND     AIRLINE      DISTANCE 
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Table  B.4.  Likelihood  Ratio  Coodneasi-of-flt  y^alaea   for  Predicting  Responses  of  Subsamples 
to  Functional  Destinations  Defined  In  3  x  3  Contingency  Tables.  "^'^ 


Interaction 
Fitted  in 

Type  of 

Subsanple   and  Contingency  Table 

Market 

Total  Raw     FLA  >  50,000  ft2 

DAL  > 

1.0  Unit 

FLA  > 

50,000  ft2 

Segment 

Model  above 
Main  Effect 

Sampl 

e 

DAL  > 

1.0  unit 

(Sample  Size] 

FLAC 

IMPC 

FLAC 

EMPC 

FLAC 

EMPC 

FLAC 

EMPC 

X 

X 

X 

X 

X 

X 

X 

X 

DALC 

DALC 

DALC 

DALC 

DALC 

DALC 

DALC 

DALC 

^ 

194.04 

179.79 

157.12 

136.16 

82.18 

74.09 

81.03 

67.86 

1.1 

179.93 

175.73 

143.86 

132.72 

80.22 

73.26 

76.52 

65.26 

1,2 

186.63 

178.22 

155.99 

135.97 

81.45 

73.98 

80.93 

67.24 

Total 

2.1 

180.27 

163.78 

145.54 

122.16 

81.95 

74.09 

79.50 

67.13 

2.2 

137.09 

106.28 

121.55 

88.43 

52.79 

39.04 

61.95 

44.91 

Sample  Size 

2772 

2772 

2066 

2066 

1912 

1912 

1491 

1491 

Avg.  Income 

^ 

92.89 

80.62 

97.44 

83.10 

33.08 

27.38 

32.65 

24.40 

>  $6000 

1.1 

88.32 

77.32 

92.81 

79.74 

33.01 

25.56 

32.58 

22.70 

1.2 

77.76 

60.53 

84.08 

83.09 

27.17 

27.36 

28.08 

24.02 

2.1 

88.44 

74.86 

92.92 

77.16 

32.36 

25.85 

31.87 

22.67 

2.2 

64.57 

39.51 

71.03 

43.39 

20.60 

9.61 

21.89 

8.30 

Sample  Size 

537 

537 

382 

382 

316 

316 

227 

227 

Avg.  Income 

. 

138.45 

125.28 

103.58 

88.02 

53.35 

50.19 

39.60 

34.82 

$6001  - 

1.1 

129.33 

123.92 

95.49 

87.05 

46.31 

50.16 

31.42 

34.55 

$7000 

1.2 

137.91 

124.55 

103.27 

87.78 

53.33 

50.17 

38.92 

33.72 

2.1 

123.30 

109.54 

92.34 

76.05 

53.26 

49.43 

39.35 

33.50 

2,2 

83.29 

70.94 

67.97 

54.26 

32.85 

25.56 

29.13 

21.54 

Sample  Size 

757 

757 

543 

543 

490 

490 

376 

376 

Avg.  Income 

. 

8.24 

11.86 

8.49 

9.60 

31.23 

23.22 

40.71 

30.08 

$7001  - 

1.1 

8.22 

11.72 

8.36 

9.42 

31.22 

21.98 

40.56 

27.80 

$8000 

1.2 

6.85 

11.78 

5.66 

9.47 

29.25 

22.67 

37.49 

29.99 

2.1 

7.74 

11.75 

7.37 

9.13 

28.42 

23.20 

34.94 

29.36 

2,2 

2.96 

5.09 

5.17 

5.08 

22.17 

16.52 

33.06 

24.77 

Sample  Size 

828 

828 

658 

658 

580 

580 

469 

469 

Avg.  Income 

_ 

14.32 

10.95 

4.03 

2.54 

1.29 

1.34 

4.64 

2.71 

>  $8000 

1.1 

13.16 

10.73 

3.46 

2.47 

1.28 

1.12 

3.89 

2.69 

1,2 

12.42 

10.62 

3.98 

2.35 

0.87 

1.32 

4.55 

2.39 

2,1 

13.58 

9.62 

3.67 

1.81 

1.27 

1.32 

4.47 

2.01 

2,2 

13.51 

8.84 

4.03 

2.39 

1.27 

0.88 

4.62 

2.59 

Sample  Size 

650 

650 

483 

483 

526 

526 

419 

419 

^FIAC  and  DALC  are  Zonal  Retail  Floor  Area  (FLA)  and  Airline  Distance  (DAL)  Categories 
Defined  In  Figure  B.3 
2qipC  and  DALC  are  Zonal  Retail  Employment  and  Airline  Distance  Categories  Defined  In 
Figure  B.4 

^d.f.  -  4  for  Main  Effect  Models 
d.f.  •  3  for  Main  Effect  and  Single  Interaction  Models 
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miles)  and  where  a  combination  of  these  two  restrictions  was  used, 
were  considered. 

As  can  be  noticed  in  Table  B.4,the  Retail  Floor  Area  restriction 
improves  the  goodness-of-fit  for  all  cases,  making  that  for  the  high 
income  market  segments  extremely  good.  This  supports  the  contention 
that  local  travel  should  be  excluded  in  model  development,  as  discussed 
in  Chapter  2.   It  also  shows  that  the  shopping  behavior  of  higher 
income  families  (with  respect  to  major  retail  outlet  patronage)  who 
usually  live  in  outlying  areas  can  be  adequately  captured  by  using 
functional  destinations  defined  with  spatial  separation  and  shopping 
attraction  measures.   It  is  quite  evident  that  further  shopping  oppor- 
tunity or  attraction  measures  or  a  more  direct  definition  of  such  mea- 
sures is  needed  particularly  for  analyzing  the  shopping  behavior  of  the 
lower  income  market  segments. 

With  the  Interzonal  Airline  Distance  restriction  (excluding  trips 
with  interzonal  distance  less  than  1  distance  unit  (0.833  mile)), 
improvements  in  goodness-of-fit  are  achieved,  compared  to  the  total  raw 
sample,  for  all  income  categories  except  for  the  third.  This  could 
be  because  of  the  means  of  sample  screening  or  random  effects.   The 
nature  of  the  data  did  not  allow  a  further  investigation  of  this  obser- 
vation.  Similar  remarks  hold  for  the  case  where  the  Retail  Floor  Area 
and  Airline  Distance  restrictions  were  combined. 

The  ten  models  that  gave  good  results  in  functional  destination 
prediction,  as  discussed  above,  are  summarized  in  Tables  B.5  -  B.14. 
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Table  B.5,   Response  Prediction  for  Functional  Destinations  Defined  by 

the  crossing  of  Zonal  Retail  Floor  Area  and  Airline  Distance 
-  Total  Raw  Sample 

Avg.  Income  $7001  -  $8000 

Parameter  Std.  Deviation 

Row  (1):  -0.4919      (0.0893) 

Row  (2):  -0.0716       (0.0832) 

;3   tM    98    70   35      Col.  (1):  -0.3334      (0.0795) 

Col.  (2):  -0.8938      (0.0945) 

2*2 

Interaction:  0.4131    (0.1785) 

L.R.  Goodness-of-fit  x^J  2.96  (3  d.f.) 

lAll  Classification  Categories  Defined  in  Figure  B.3  and  B.4 

2cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 


DALC-"-'^ 
1     2 

I 
3 

160 

115 

57 

(168) 

(106) 

(58) 

98 

70 

35 

(100) 

(69) 

(34) 

130 

93 

70 

(120) 

(103) 

(70) 
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Table  B.6.  Response  Prediction  for  Functional  Destination  Defined  by 

the  Crossing  of  Zonal  Retail  Employment  and  Airline  Distance 
-  Total  Raw  Sample 

Avg.  Income  $7001  -  $8000 

DALC^'^  Parameter  Std.  Deviation 

1     2     3_ 

Row  (1):  -0.8398      (0.0970) 

Row  (2):  -0.0224       (0.0803) 

t)   j^    74    53   25      Col.  (1):  -0.3334      (0.0784) 

Col.  (2):  -0.9155      (0.0964) 

2,2 

Tnteraction:  0.04608   (0.1769) 


CO 


171 

122 

59 

(185) 

(114) 

(53) 

74 

53 

25 

(68) 

(53) 

(31) 

143 

103 

78 

(135) 

(111) 

(78) 

L.R.  Goodness-of-fit  x^:  5.09  (3  d.f.) 


IaII  Classification  Categories  Defined  in  Figures  B.3  and  B.4 
^Cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B.7.  Response  Prediction  for  Functional  Destinations  Defined 
by  the  Crossing  of  Zonal  Retail  Floor  Area  and  Airline 
Distance  -  Floor  Area  >  50,000  sq.  ft. 

Avg.  Income  $7001  -  $8000 

Parameter         Std,  Deviation 

Row  (1);  0,2256       CO. 1051) 

Row  C2):  0,6364        CO. 1002) 

3      ^         97    69   37      Col.  CD:  -0.3404      CQ.0886) 


m 


DALC^'' 
1            2 

I 
3 

78 

55 

29 

(85) 

(45) 

(32) 

97 

69 

37 

(100) 

(69) 

(34) 

130 

93 

70 

(120) 

(103) 

(70) 

Col.  (2):  -0.8561      (0.1081)  ■ 

2,2 

Interaction:  Q.3521    (0.1943) 

L.R.  Goodness-of-fit  ^x,^:   5.17  (3  d.f.) 

^All  Classification  Categories  Defined  in  Figures  B.3  and  B.4 

2 
Cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B.8.  Response  Prediction  for  Functional  Destinations  Defined 
by  the  Crossing  of  Zonal  Retail  Employment  and  Airline 
Distance  -  Floor  Area  >  50,000  sq.  ft. 


es 


en 


Avg.  Income  $7001  -  $8000 


DALC 


1,2 


1 

2 

3 

90 

64 

32 

(102) 

(55) 

(29) 

71 

51 

26 

(68) 

(51) 

(29) 

144 

102 

78 

(135) 

(111) 

(78) 

Parameter 

Row  (1):  -0.2285 

Row  (2):  0.6074 

Col.  (1):  -0.3404 

Col.  (2):  -0.8855 

2,2 

Interaction:  0.4113 


Std.  Deviation 

(0.1101) 
(0.0956) 
(0.0886) 
(0.1120) 
(0.614) 


L.R.  Goodness-of-fit  x  •  5.08  (3  d.f.) 


^All  Classification  Categories  Defined  in  Figures  B.3  and  B.4 
Cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B,9,  Response  Prediction  for  Functional  Destinations  Defined 
by  the  Crossing  of  Zonal  Retail  Floor  Area  and  Airline 
Distance  -  Floor  Area  >  50,000  sq.  ft. 


Avg.  Income  >  $8000 
DALC^'2 


1 

2 

3 

36 

46 

35 

(44) 

(39) 

(34) 

67 

CM 

87 

67 

(62) 

(91) 

(68) 

44 

CO 

57 

44 

(41) 

(60) 

(44) 

Parameter 

Row  CD:  0.6360 
Row  (2):  0.2146 
Col.  (1):  0.2566 
Col.  (2):  -0.0068 


Std.  Deviation 

(0,1142) 
CO. 1260) 
CO. 1098) 
CO. 1167) 


L.R.  Goodness-of-fit  x  :  4.03  (4  d.f.) 


All  Classification  Categories  Defined  in  Figures  B.3  and  B.4 
Cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B.IO.  Response  Prediction  for  Functional  Destinations  Defined 
by  the  Crossing  of  Zonal  Retail  Employment  and  Airline 
Distance  -  Floor  Area  >  50,000  sq.  ft. 


<s 


Avg.  Income  >  $8000 


1 

DALC^' 
2 

2 
3 

43 

56 

43 

(49) 

(50) 

(43) 

42 

55 

42 

(43) 

(56) 

(40) 

61 

80 

61 

(55) 

(84) 

(63) 

Parameter 

Row  (1):  -0.0214 
Row  (2):  0.3524 
Col.  (1):  0.2566 
Col.  (2):  -0.0068 


Std.  Deviation 

(0.1191) 
(0.1092) 
(0.1098) 
(0.1168) 


..R.  Goodness-of-fit  x^:  2.54  (4  d.f.) 


^All  Classification  Categories  Defined  in  Figures  B.3  and  B.4 
Cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B,ll,  Response  Prediction  for  Functional  Destinations  Defined 
by  the  Crossing  of  Zonal  Retail  Floor  Area  and  Airline 
Distance  -  Airline  Distance  >  1.0  unit 


Avg.  Income  >  $8000 

DALC^'2 
12     3 


39 

98 

70 

(43) 

(98) 

(66) 

36 

90 

65 

(32) 

(91) 

(68) 

24 

tn 

61 

43 

(24) 

(60) 

(44) 

Parameter 

Row  (1):  -0.0804 
Row  (2):  -0.4807 
Col.  (1):  0.9223 
Col.  (2):  0.587 


Std,  Deviation 

(0.1001) 
(0.1122) 
(0.1185) 
CO. 1251) 


L.R.  Goodness-of-fit  x^J  1-29  (4  d.f.) 

i 
^All  Classification  Categories  Defined  in  Figures  B.3  and  B.4 

^Cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B,  12.  Response  Prediction  for  Functional  Destinations  Defined 

by  the  Crossing  of  Zonal  Retail  Exployment  and  Airline  Dis- 
tance -  Airline  Distance  >  1.0  unit 


«n 


Avg.   Income  >   $8000 

DALC^'^ 
12  3 


43 

107 

77 

(44) 

(109) 

(74) 

23 

58 

42 

(26) 

(56) 

(41) 

33 

83 

60 

(29) 

(84) 

(63) 

Parameter 

Row  (1):  -0.6128 
Row  (2):  -0.2545 
Col.  (1):  0.9223 
Col.    (2):   0.5867 


Std.  Deviation 

(0.1119) 
(0.1002) 
(0.1187) 
(0.1251) 


L.R.  Goodness-of-fit  x  •  1.34  (4  d.f.) 


^All  Classification  Categories  Defined  in  Figures  B.3  and  B,4 
^Cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B,13.  Response  Prediction  for  Functional  Destinations  Defined 
by  the  Crossing  of  Zonal  Retail  Floor  Area  and  Airline 
Distance  -Floor  Area  > 50,000  sq.  ft.  -  Airline  Distance 
>  1.0  unit 


Avg,  Income  >  $8000 


04 


cn 


1 

DALCl» 
2 

2 
3 

20 

45 

35 

(27) 

(39) 

(34) 

38 

87 

67 

(32) 

(91) 

(68) 

25 

58 

44 

(24) 

(60) 

(44) 

Parameter 

Row  (1);  0.6471 
Row  (2):  0.2469 
Col.  a):  0.8282 
Col.  C2):  0.5648 


Std.  Deviation 

CO.  1235) 
CO, 1331) 
(0.1314) 
CO. 1372) 


L.R.  Goodness-of-fit  x^J  4.64 

^All  Classification  Categories  Defined  in  Figures  B.3  and  B.4 
2cell  Entries  are  Predicted  Frequencies  with  Observed  Values  in  Paren- 
thesis 
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Table  B,14.  Response  Prediction  for  Functional  Destinations  Defined 
by  the  Crossing  of  Zonal  Retail  Employment  and  Airline 
Distance  -  Floor  Area  > 50,000  sq.  ft.  -  Airline  Distance 
>  1.0  unit 


Avg.  Income  >  $8000 


cs 


tn 


1 

DALC^' 
2 

2 

3 

24 

55 

42 

(28) 

(50) 

(43) 

24 

55 

43 

(26) 

(56) 

(40) 

35 

80 

61 

(29) 

(84) 

(63) 

Parameter 

Row  CD;  0.0082 
Row  (2):  0.3747 
Col.  (1):  0.8282 
Col.  (2):  0.5648 


Std.  Deviation 

CO, 1282) 
CO. 1180) 
CO. 1314) 
C0.1372) 


L.R.  Goodness-of-fit  x^-  2.71 

^All  Classification  Categories  Defined  in  Figures  B,3  and  B.4 

2 
Cell  Entries  are  Predicted  Frequencies  with.  Observed  Values  in  Paren- 
thesis 
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Some  Further  Modelling  Considerations 

Having  now  agreed  that  the  shopping  behavior  of  higher  income  mar- 
ket segments  can  be  sufficiently  investigated  by  using  the  available 
zonal  data  on  Retail  Floor  Area,  Employment  and  Airline  Distance,  one 
may  now  be  interested  in  testing  if  the  3rd  and  4th  market  segments 
differ  in  their  response  patterns  and  if  the  patterns  are  consistent. 
The  first  question  can  be  answered  by  examining  Table  B.4.   It  is 
noticed  that  where  one  high  income  category  gives  good  results  the 
models  for  the  other  category  are  poor.  Where  both  high  income  market 
segments  have  well-fitting  models,  one  achieves  that  with  main  effects 
only  while  the  other  does  so  with  main  effects  and  a  2  ,2  interaction. 
The  need  for  the  market  segmentation  adopted  is  therefore  justified. 


Consistency  of  Functional  Zonal  Destination  Choice  Models 

An  examination  of  the  signs  and  magnitudes  of  parameters  estimated 
for  the  well-fitting  models  discussed  above  (see  Tables  B.5  to  B.14) 
shows  patterns  that  cannot  be  easily  explained  with  information  avail- 
able except  for  the  strong  likelihood  that  it  is  a  result  of  the  use  of 
traffic  zonal  boundaries  for  defining  shopping  areas.  One  would  normally 
expect  that,  with  the  exclusion  of  local  travel,  the  patronage  of  shop- 
ping areas  would  increase  with  retail  floor  area  and  decrease  with 
distance.   Under  these  conditions  one  would  expect  that,  in  general,  the 
row  parameters  that  correspond  to  zonal  retail  floor  area  effects  would 
be  positive  and  that  the  column  parameters  representing  interzonal 
distance  effects  would  be  negative.   This  was  not  the  situation  in  all 
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the  models  summarized  in  Tables  B.5  to  B.14. 

Statistical  tests  were  carried  out  on  model  parameter  vectors, 
the  equality  of  which  for  any  pair  of  populations  would  establish 
similarity  of  response  patterns.   The  models  that  were  considered  are 
listed  in  Table  B.15.   The  multivariate  statistical  testing  procedures 
used  have  been  presented  in  Chapter  7.   70  percent  and  50  percent  sub- 
samples  were  taken  from  the  data  set  used  for  each  model  of  functional 
destination  choice.   The  original  total  sample,  the  70  percent  sub- 
sample,  and  the  50  percent  sub-sample  were  denoted  as  populations  I,  II 
and  III  respectively.  Comparisons  of  populations  I  with  II,  I  with  III, 
and  II  with  III  were  made  for  each  model.  All  the  tests  did  not  reject 
the  hypothesis  that  the  parameter  vectors  in  all  the  sub-samples 
(including  the  total  sample)  had  the  same  covariance  matrix.  However, 
the  inconsistency  in  the  signs  of  the  parameters  attributed  above  to  the 
inadequacy  of  the  use  of  zonal  shopping  destinations, prevailed  over  the 
models  developed  to  such  an  extent  that  the  hypothesis  of  the  equality 
of  parameter  vectors  was  rejected  in  all  cases. 

The  serious  inadequacy  of  the  use  of  zonal  shopping  areas  in 
functional  destination  choice  prediction  discussed  above  strongly  calls 
for  the  use  of  specific  shopping  areas  in  functional  shopping  destina- 
tion definition,  the  effectiveness  of  which  has  been  discussed  in 
Chapter  2  and  empirically  demonstrated  in  Chapter  7. 
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Table  B.15.  Functional  Zonal  Destination  Choice  Models  Considered 
in  Sub-Sample  Analysis 


Functional  Zonal 

Destination 

Classification-'- 

Origin  Zone 
Average  Income 
For  Sub-Sample 

Condition  for 

Subsamples 
Selectioi^  »^ 

FLAC  X  DALC 

$7001  -  $8000 

FLA 

> 

50 

EMPC  X  DALC 

$7001  -  $8000 

FLA 

> 

50 

FLAC  X  DALC 

Greater  Ttian 
$8000 

FLA 

> 

50 

EMPC  X  DALC 

Greater  Than 
$8000 

FLA 

> 

50 

FLAC  X  DALC 

Greater  Than 
$8000 

DAL 

> 

1.0 

EMPC  X  DALC 

Greater  Than 
$8000 

DAL 

> 

1,0 

FLAC  X  DALC 

Greater  Than 
$8000 

FLA 
DAL 

> 

> 

50 
1.0 

EMPC  X  DALC 

Greater  Than 
$8000 

FLA 
DAL 

> 
> 

50 
1.0 

1 
FLAC,  EMPC  and  DALC  are  Zonal  Retail  Floor  Area,  Zonal  Retail 

Employment  and  Inter-?onal  Airline  Distance  Categories  Defined  in 

Figures  B.l  and  B.2 

2 
FLA  is  a  Zonal  Retail  Floor  Area  Unit  of  1000  sq.  ft. 

%AL  is  an  Airline  Distance  Unit  of  0.833  miles 
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